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Description 

This application is a continuation-in-part of Looker and Hoffman, U.S. Ser. No. 07/374,161, DI-ALPHA AND D I- BETA 
GLOBIN LIKE POLYPEPTIDES AND USES THEREFOR, filed June 30, 1989; Stetler and Wagenbach, U.S Ser No 
07/379,1 16, PRODUCTION OF HUMAN HEMOGLOBIN BY TRANSFORMED YEAST CELLS, filed July 13, 1989; and 
Hoffman, Looker, Rosendahl and Stetler, U.S. Ser. No. 07/349,623, POLYCISTRONIC CO-EXPRESSION OF THE 
ALPHA- AND BETA GLOBINS AND IN VIVO ASSEMBLY OF BIOLOGICALLY ACTIVE, TETRAMERIC HEMOGLOBIN, 
filed May 10, 1989; all owned by Somatogenetics International, Inc. 

Cross- Reference to Related Applications 

Hoffman and Nagai, U.S. Ser. No. 07/194,338, filed May 10, 1988 t presently owned by Somatogenetics international, 
Inc., relates to the use of low oxygen affinity mutant hemoglobins as blood substitutes, and to the expression of alpha 
and beta globin in nonerythroid cells. 

Field of the Invention 

The present invention relates to the intracellular assembly of a human hemoglobin-like protein in biologically func- 
tional, substantially soluble form through co-expression of alpha- and beta-globin-Iike polypeptides in bacterial or yeast 
cells. 

It further relates to the genetic fusing of the two alpha subunits of hemoglobin to form a novel polypeptide, di-alpha 
globin, which may be considered a partially assembled intermediate leading to a hemoglobin-like protein, and the use 
of this compound in the production of synthetic hemoglobins having an increased intravascular half-life as compared to 
stroma-free hemoglobins. It also relates to the analogous polypeptide di-beta globin. 

Information Disclosure Statement 

It is not always practical to transfuse a patient with donated blood. In these situations, use of a red blood cell substitute 
is desirable. The product must effectively transport 0 2 , just as do red blood cells. ("Plasma expanders", such as dextran 
and albumin, do not transport oxygen.) The two types of substitutes that have been studied most extensively are hemo- 
globin solutions and fluorocarbon emulsions. 

Structure and Function of Hemoglobin 

Hemoglobin (Hgb) is the oxygen-carrying component of blood. Hemoglobin circulates through the bloodstream 
inside small enucleate cells called erythrocytes (red blood cells). Hemoglobin is a protein constructed from four asso- 
ciated polypeptide chains, and bearing prosthetic groups known as hemes. The erythrocyte helps maintain hemoglobin 
in its reduced, functional form. The heme iron atom is susceptible to oxidation, but may be reduced again by one of two 
enzyme systems within the erythrocyte, the cytochrome b 5 and glutathione reduction systems. 

About 92% of the normal adult human hemolysate is Hgb A (designated alpha2 beta2, because it comprises two 
alpha and two beta chains). The alpha chain consists of 141 amino acids. The iron atom of the heme (ferroprotoporphyrin 
IX) group is bound covalently to the imidazole of His 87 (the "proximal histidine"). The beta chain is 146 residues long 
and heme is bound to it at His 92. Apohemoglobin is the heme-free analogue of hemoglobin; it exists predominantly as 
the ap-globin dimer. 

Separated, heme-free, alpha-and beta-globins have been prepared from the heme-containing alpha and beta sub- 
units of hemoglobin. The separated heme-free globin chains are folded very differently, even though the heme-containing 
subunits are highly similar in secondary structure and basic folding features. This shows that the binding of the prosthetic 
heme group to globin subunits has quite different effects on alpha and beta globin. Yip, et al. ( J. Biol. Chem., 247: 7237- 
44(1972). 

Native human hemoglobin has been fully reconstituted from separated heme-free alpha and beta globin and from 
hernia Preferably, heme is first added to the alpha-globin subunit. The heme-bound alpha globin is then complexed to 
the heme-free beta subunit. Finally, heme is added to the half-filled globin dimer. and tetrameric hemoglobin is obtained. 
Yip, et al., PNAS (USA), 74: 64-68 (1977). 

In cell-free systems prepared from unfractionated rabbit reticulocyte hemolysates. globin is actively synthesized for 
approximately five minutes, and then protein synthesis abruptly ceases. Prior addition of hemin prevents or delays the 
cessation of synthetic activity, as a result of the effect of hemin on an inhibitory protein known as "hemin-regulated 
inhibitor" (HRI). Hemin deficiency has a more severe effect on alpha chain synthesis than on beta chain synthesis as 
alpha globin mRNA is less efficient than beta-globin mRNA in initiating polypeptide chain synthesis. It has been spec- 
ulated that alpha chains are released from their site of synthesis only in the presence of free beta chains, which imme- 
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diately complex the released alpha chains to form ap-globin dimers. These then combine with heme to form tetrameric 
hemoglobin Wmterhalter and Huehns, J. Biol. Chem., 239: 3699 (1964). It is certainly known that the addition of heme 
to ap-giobm doners (apohemoglobin) leads to the rapid formation of hemoglobin. 

The human alpha and beta globin genes reside on chromosomes 1 6 and 1 1 , respectively. Bunn and Forget, supra 
5 at 1 72 Both genes have been cloned and sequenced, Liebhaber, et al. , PNAS 77: 7054-58 ( 1 980) (alpha-globin genomic 
DNA). Maroria e* at J B*dI Chem., 252: 5040-53 (1977) (beta globin cDNA); Lawn, et al., Cell, 21 :647 (1980) (beta 
globin genome DNA) 

Herrx>gk3t4n eif^txts cooperative binding of oxygen by the four subunits of the hemoglobin molecule (two alpha- 
globins anc t\*o tela giDtans in me case of Hgb A), and this cooperativity greatly facilitates efficient oxygen transport. 
w Cooperative ac^eved Oy the so-called heme-heme interaction, allows hemoglobin to vary its affinity for oxygen. Hemo- 
globin revels** * t*nos up to tour moles of oxygen per mole of Hgb. 

Oxygen can ,ing compounds are frequently compared by means of a device known as an oxygen dissociation curve. 
This curve is obtained *nen, for a given oxygen carrier, oxygen saturation or content is graphed against the partial 
pressure c* oiyven Fo H^b. the percentage of saturation increases with partial pressure according to a sigmoid rela- 
ys tionship. Tbt? Pv> Vh? pc*t t**i pressure at which the oxygen-carrying solution is half saturated with oxygen. It is thus a 
measure of o*rv*° t* r *>ng affinity, the higher the P50, the more loosely the oxygen is held. 

When the o» *gen ossocMtion curve of an oxygen-carrying solution is such that the P50 is less than that for whole 
blood, it is sa>d to be "left sotted * 

The oxygen ati*n*t> of *>omog*obin is lowered by the presence of 2,3-diphosphoglycerate (2,3-DPG), chloride ions 
20 and hydrogen one F <~4>r mg troche releases carbon dioxide into the blood and lowers its pH (i.e. increases the hydrogen 
ion concentrat on) the* eby caj&tng oxygen to dissociate from hemoglobin and allowing it to diffuse into individual cells. 

The ability of nerojioexn to aher its oxygen affinity, increasing the efficiency of oxygen transport around the body, 
is dependent on n* oesence of the metabolite 2,3-DPG. Inside the erythrocyte 2,3-DPG is present at a concentration 
nearly as great »<; th*t of h*>mogjobin itself. In the absence of 2,3-DPG "conventional" hemoglobin binds oxygen very 
25 tightly and would riMpase little oxygen to respiring tissue. 

Aging erythrocytes release small amounts of free hemoglobin into the blood plasma where it is rapidly bound by 
the scavenging protein haptoglobin. The hemoglobin-haptoglobin complex is removed from the blood and degraded by 
the spleen and hvef 

30 Blood Substitutes. Generally 

It is clear from the above considerations that free native hemoglobin A, injected directly into the bloodstream, would 
not support efficient oxygen transport about the body. The essential allosteric regulator 2,3-DPG is not present in suffi- 
cient concentration in the plasma to allow hemoglobin to release much oxygen at venous oxygen tension. 

35 Nonetheless, solutions ol conventional hemoglobin have been used as RBC substitutes. The classic method of 
preparing hemoglobin solutions employs outdated blood. The red cells are lysed and cellular debris is removed, leaving 
what is hopefully "stromal-free hemoglobin" (SFH). 

Several basic problems have been observed with this approach. The solution must be freed of any toxic components 
of the red cell membrane without resorting to cumbersome and tedious procedures which would discourage large-scale 

40 production. DeVenuto, "Appraisal of Hemoglobin Solution as a Blood Substitute", Surgery. Gynecology and Obstetrics. 
149:417-436(1979). 

Second, as expected, such solutions are "left-shifted" (lower P 50 ) as compared to whole blood. Gould, et al., "The 
Development of Polymerized Pyridoxylated Hemoglobin Solution as a Red Cell Substitute", Ann. Emerg. Med. 15: 1416- 
1419 (Dec. 3, 1986). 

45 Third, SFH has a half-life in the circulatory system of only about 2-4 hours. This is because oxy Hgb partially dis- 
sociates into a dimer (op) that is small enough to be filtered by the kidney. 

Finally, SFH has a high colloid osmotic pressure (COD). Thus, administration of SFH in a dose that would have the 
same oxygen-carrying capacity as a unit of packed red blood cells is inadvisable, since the high osmotic pressure (60mm 
Hg) would cause a massive influx of water from the cells into the bloodstream, thus dehydrating the patient's tissues. 

so This consideration limits the dose of SFH to that which provide a final concentration of about 6-8 gm Hgb/dl. 

In an effort to restore the desired P 50 , researchers added 2,3-DPG to the hemoglobin solution. Unfortunately, 2,3- 
DPG was rapidly eliminated from the circulation. Scientists then turned toother organic phosphates, particularly pyridoxal 
phosphate. Like 2,3-DPG, these compounds stabilized the T state" of the Hgb by forming a salt bridge between the N- 
termini of the two beta chains. The pyridoxylated hemoglobin had a P 50 of 20-22 torr, as compared to 10 torr for SFH 

55 and 28 torr for whole blood. While this is an improvement over SFH. the pyridoxylated Hgb remains "high affinity" relative 
to whole blood. 


3 


BP 0 700 997 A1 

Chemical Crosslinking of Hemoglobin Subunits 

The properties of hemoglobin have been altered by specifically chemically crosslinking the alpha chains between 
the Lys99 of alphal and the Lys99 of alpha2. Walder, U.S. 4,600,531 and 4,598,064; Snyder, et al., PNAS (USA) 84: 
5 7280-84 (1987); Chaterjee, et al., J. Biol. Chern., 261 : 9927-37 (1986). The P 50 was 29 mm Hg, and renal excretion was 
abrogated by the crosslinking, but the plasma half-life was increased just 2-3 fold. 

This chemical crosslinking was accomplished by reacting bis(3,5-dibromosalicyl) fumarate with deoxy hemoglobin 
A in the presence of inositol hexaphosphate. This reaction has a low yield (10-20%). Moreover, purification is required 
to eliminate derivatives modified at other sites (there are 42 other lysine residues and the amino terminal amino groups 
io of the four chains at which competing reactions could occur). 

A further problem with the use of a "diaspirin" crosslinking agent is that it can participate in a side reaction yielding 
a carcinogenic halophenol. 

In the hemoglobin analogue of the present invention, the N-terminal valine and C-terminal arginine of the alpha 
globins are connected by means of an amino acid or peptide linker, without resort of special coupling agents. 
75 The beta chains have also been chemically crosslinked. Kavanaugh, et al., Biochemistry. 27: 1804-8(1988). 
Kavanaugh notes that the beta N-termini are 16 A apart in the T state and 20 A apart in the R state. Not surprisingly, 
the introduction of a DIDS bridge between the N-termini of T state hemoglobin hindered the shift to the R state, thereby 
decreasing the 0 2 affinity of the molecule. While the Kavanaugh analogue has desirable oxygen binding and renal 
clearance characteristics, it too is obtained in low yield. 

20 

D. Gene Expression, Generally 

Gene expression embraces the transcription of DNA into messenger RNA, and the translation of messenger RNA 
into protein. The process of transcription begins when an enzyme, DNA-directed RNA polymerase, binds to DNA. The 

25 binding site for this enzyme is often called the "promoter," and the binding of the enzyme to the promoter may be controlled 
by various repressors or inducers of transcription. The RNA polymerase slides along the DNA molecule, manufacturing 
a messenger RNA transcript. When it encounters a second regulatory element, the "terminator," the enzyme falls off, 
and the mRNA transcript is formed. 

Messenger RNA is used by the ribosomes, the protein factories of the cell, as a template for the construction of the 

30 corresponding protein. The ribosomal binding site comprises the so-called Shine Delgarno (SD) sequence and a properly 
spaced initiation (start) codon. Beginning at a special RNA triplet known as the initiation codon, transfer RNAs bind to 
corresponding codons of the messenger. Each transfer RNA is two-handed; it binds to the messenger codon by means 
of a complementary anti-codon, while holding the corresponding amino acid in position to be linked into the growing 
polypeptide chain. The chain falls off when the ribosome encounters one of three special triplets known as "stop" codons. 

35 That part of the original gene which corresponds to the messenger sequence from the initiator codon to the last codon 
before the stop codon is known as the coding sequence. There is also a S'-flanking sequence, which begins with the 
promoter, and a 3 -f lanking sequence which ends with the terminator. 

Polycistronic Expression 

40 

It is possible for a single messenger RNA transcript to have one promoter, but two or more pairs of start and stop 
codons that define distinctly translatable sequences. Each such sequence is known as a "cistron," and the polypeptide 
corresponding to the cistrons are thus co-expressed under the control of the single promoter. 

The majority of bacterial operons are polycistronic, that is, several different genes are transcribed as a single mes- 
45 sage from their operons. Examples include the lactose operon with three linked genes flacZ. lacY and lacA) and the 
tryptophan operon with five associated genes {IrgE. irp_D, trp.C, trgB, and trpA). In these operons, the synthesis of mes- 
senger RNA is initiated at the promoter and, within the transcript, coding regions are separated by intercistronic regions 
of various lengths. (An operon is a cluster of genes that is controlled as a single transcriptional genetic unit). Translational 
■ efficiency varies from cistron to cistron. Kastelein, et al., Gene, 23: 245-54 (1983). 
so When intercistronic regions are longer than the span of the ribosome (about 35 bases), dissociation at the stop 
codon of one cistron is followed by independent initiation at the next cistron. With shorter intercistronic regions, or with 
overlapping cistrons, the 30S subunit of a terminating ribosome may fail to dissociate from the polycistronic mRNA, 
being instantly attracted to the next translational initiation site. Lewin. Gene Expression. 143-148. 

_ . . Unlike bacterial mRNAs. eukaroyotic mRNAs are generally monocistronic in nature. Lewin, Gene Expression. 157. — 

55 Synthetic polycistronic operons have been constructed and expressed in both prokaryotes and eukaryotes. 

Lee, et al., Nucleic Acids Res., 12: 6797 (1984) describe a special case of a synthetic polycistronic operon in which 
all of the cistrons express the same polypeptide. This hbmopolycistronic structure was constructed to maximize the gene 
dosage effect. 
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Schoner, et al., PNAS, 83: 8506-10 (1986) translated a synthetic two-cistron mRNA in E coN. The first cistron was 
a short, arbitrary AU-rich sequence, while the second cistron was a mammalian gene (bGH). It was found that "read 
through" translation occurred if the stop codon of the first cistron followed the SD element of the second cistron and lay 
close to the start codon of the second cistron. Schoner's purpose was to overcome his failure to express Met-bGH with 
5 its native codons at high levels, possibly as a result of inhibition of translation by local secondary structures. The first 
cistron was engineered to favor ribosome binding (by placing the SD sequence and the AUG initiation codon in an AU- 
rich region free of local secondary structure). See also Schoner, et al.. Meth. Enzymol., 153: 401-416 (1987). which 
reveals that bGH overproduction by this technique was associated with the formation of protein granules. 

Saito, et al.. J. Biochem., 101 : 1281-88 (1987) expressed a synthetic somatomedin C gene in E coh using a two 
io cistron system. They theorized that the instability of somatomedin C, a basic polypeptide, might -be overcome by com- 
plexing it with an acidic polypeptide. Thus, they constructed a two-cistron system which could express both polypeptides. 
The termination codon of the first cistron overlapped the initiation codon of the second cistron. The transformants accu- 
mulated Somatomedin C at high levels. However, the somatomedin C was recovered in the form of insoluble pellets (see 
page 1282). 

is The ribosomes of mammalian cells are likewise capable of reinitiating translation at an initiation codon downstream 
from a termination codon. Thus, Boel, et al., FEBS Lett., 219:181 (1987) expressed a dicistronic transcription unit in 
mammalian (CHO) cells. This unit directed synthesis of both the precursor of human pancreatic polypeptide and of a 
selectable genetic marker (mouse DHFR). 

CODON, WO88/05486 describes the production of dicistronic mRNA which encodes both a protein of interest (e.g., 

20 tissue plasminogen activator) and a selectable phenotype (e.g., neomycinresistance). The common promoter was, in 
each of the examples a derivative of the Harvey murine sarcoma virus, and the dicistronic mRNA was translated in 
suitable eukaryotic cells. 

GENENTECH, EP Appl 1 1 7,058 discloses the expression in vertebrate host cells of a dicistronic expression vector 
wherein one cistron codes for the desired protein (e.g., HbsAg) and a second codes for a second protein (e.g., DHFR) 
25 whose synthesis is subject to environmental control (e.g., with methotrexate). 

F. Fused Genes and Proteins, Generally 

Genes may be fused together by removing the stop codon of the first gene, and joining it in phase to the second 

30 gene. Parts of genes may also be fused, and spacer DNAs which maintain phase may be interposed between the fused 
sequences. The product of a fused gene is a single polypeptide, not a plurality of polypeptides as is expressed by a 
polycistronic operon. Different genes have been fused together for a variety of purposes. Thus, Gilbert, U.S. 4,338,397 
inserted a rat preproinsulin gene behind a fragment of the E. coh penicillinase gene. His purpose was to direct EL coH 
transformants to secrete the expression product of the fused gene. Fused genes have also been prepared so that a 

35 non-antigenic polypeptide may be expressed already conjugated to an immunogenic carrier protein. The present inven- 
tion, however, contemplates the joining of two copies of the same gene. 

The use of linker DNA sequences to join two different DNA sequences is known. These linkers are used to provide 
restriction sites for DNA cleavage, or to encode peptides having a unique character that facilitates purification of the 
encoded fusion protein or a fragment thereof. See, e.g., Rutter, U.S 4,769,326. 

40 The lectin of Pisgm sativum seeds is synthesized as a single 275-amino acid preproprotein consisting of a signal 
sequence followed first by the beta chain and then by the alpha chain. The signal sequence is removed in the endoplasmic 
reticulum, and in the protein bodies the resulting "prolectin" is cleaved into a 187-AA beta chain and a 58-AA alpha 
chain. (Further processing results in truncation at the carboxyl termini). While the pea seed isolate is thus a heterodimer, 
it was discovered that the uncleaved naturally-occurring "prolectin" also binds carbohydrates, and that this "prolectin" 

45 could be expressed in EL coN and recovered in functional form. Stubbs, et al., J. Biol. Chem., 26: 61 41 -44 (1 986). 

Toth, U.S. 4,774,1 80 teaches the expression of polyprotein. This polyprotein was made from a fused DNA sequence 
encoding both a first polypeptide which catalyzes the reaction of glycine with ATP to form glycyl-adenylate and a second 
polypeptide which reacts glycyl adenylate with tRNA GLY to obtain the glycine-charged tRNA. These two polypeptides 
■ are the alpha and beta subunits of glycine tRNA synthetase which has an a$ 2 quaternary structure. The two subunits, 

so in the E, coh genome, are encoded by a single dicistronic gene. Toth linked the coding region of the alpha chain to the 
coding region of the beta chain by means of a linker encoding six amino acids. See also Toth and Schimmel, J. Biol. 
Chem., 261: 6643-46 (May 1986). 

Ladner, U.S. 4,704,692 describes an expert system for finding linkers which may be used to convert two naturally 
. aggregated but chemically separated polypeptide chains into a single polypeptide chain with a similar conformation after 

55 folding. This system relies on a database containing amino acid sequences for which 3-D structures are known. The 
database is examined for candidate amino acid sequences with a span similar in length to the interchain gap to be 
bridged. The direction and orientation of the candidate peptides in then checked. The algorithm assumes that these 
peptides will maintain the same length and orientation regardless of the flanking sequences. 
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Ladner. WO88/06601 presents a hypothetical approach to the preparation of "pseudodimeric" analogues of dimeric 
repressor proteins In essence, an amino acid linker is introduced to convert the dimeric molecule into a single chain. 
According to Ladner. this linker may be designed directly by the method of the '692 patent; alternatively, the linker- 
encoding DN A is a random oligonucleotide and in vivo selection is used to find a pseudodimer whose linker permits the 
5 molecule to foW cocrecti/ and t>nd sequence-specifically to DNA. 

Hallewell. et al . J Boi Cnem . 264: 5260-68 (1989) prepared an analogue of CuZn superoxide dismutase. Each 
dismutase molecule * a dimef of two identical subunits; a copper ion and a zinc ion are liganded to the subunit. The 
dimer interaction m CuZn super cade dismutase is so strong that the subunits have not been separated without inacti- 
vating the enzyme Tne eruyme has considerable conformational similarity to immunoglobulins; Hallewell, et al., joined 

io two human supe*oi«le dismutase genes, either directly or with DNA encoding a 19-residue human immunologlobulin 
IgAl hinge region and expressed the fused genes in a transformed host. In attempting to express the directly joined 
genes, recomt«nat>on occurred to eliminate one of the tandem genes in come plasmid molecules. Hallewell, et al., 
postulated that tr* cVect connection distorted the dimer, causing the exposure of hydrophobic areas which then had a 
toxic effect. Th s woud have provided selection pressure favoring gene deletion. No recombination was detected with 

is the lgA1 linker tx*n»t(uc*on 

Unfortunate** it c*trvx>t t*? at>b*jm«j that a pseudodimeric fusion protein containing a peptide linker will fold properly 
so to be a functional equivalent o* its parental heterodimer. 

G. Expression of Soiubio Proteins 

20 

Efforts to produce heterologous proteins in transformed cells sometimes result in the precipitation of some or all of 
the protein as insoluble inclusion bock es. also known as retractile bodies. See, e.g., Paul, etal., Eur. J. Cell Biol., 31 : 171- 
174 (1983) (human promsulirtfL CQL» trpE fusion protein); Denefle, et a!., Gene, 56:61-70 (1987) (angogenin); Langley, 
etal, Eur. J. Biochem , 163 313-321 (1987) (bovine growth hormone); Petrov, etal.. Biology of the Cell. 61:1-4 (1987) 

25 (calcitonin); Richardson et al . Biochtm Biophys. Acta, 950:385-94 (1988) (ricin B chain); Davis, et al.. Biochemistry, 
26:1322-26 (1987) (tumor neaos* factor); Lee, etal., Biochim. Biophys. Res. Commun., 151:598-607 (1988) (gamma 
interferon); Meng. et at . J Chromatogr . 443:183-92 (1988) (Somatomedin C); Tsuji, et al., Biochemistry. 26:3129-34 
(1987) (interleukin-2) The term "retractile" refers to the ability to observe these bodies by phase contrast microscopy. 
Frequently, this insoluble protem retams only a fraction of the expected biological activity, possibly due to incorrect folding. 

30 It has been suggested that inclusion bodies are formed when molecules of partially folded proteins interact with each 
other faster than they can fold into their native, active conformation. Kruger, et aL, Biopharm, 40 (March 1989); Haase- 
Pettingell and King, J. Biol Chem., 263: 4977-83 (1988). "Factors contributing to the formation of inclusion bodies in 
recombinant bacteria remain obscure and it is not easy to predict the physical state of the product of a newly expressed 
eukaryotic gene in E coli " Petrov, supra . 

35 While the formation of these inclusion bodies results in enrichment of the recombinant protein, and is therefore 
sometimes desirable, it also necessitates solubilization of the aggregates and regeneration of the protein's biological 
activity. Petrov, supra at 4, comments, "sometimes these obstacles seem to be the most critical point of the recombinant 
technology." 

Attempts have been made to solubilize and renature these proteins. Wetzel, U.S. 4,599, 1 97; Builder, U.S. 4,620,948; 

40 Olson, US. 4,51 1,503; Jones, US. 4,512,922. However, such efforts can be laborious and uncertain of success. See 
Giantini and Shatkin, Gene, 56: 153-160 (1987). As stated by Weir and Sparks, Biochem. J., 245: 85-91 (1987), "proteins 
vary considerably in their optimal conditions for renaturation; various factors such as pH, salt concentration and type, 
rate of removal of denaturant, concentration of the target protein and of contaminants may strongly affect the recovery 
of authentic protein." These complications are avoided if the protein of interest is expressed in soluble form. 

45 Gatenby, etal., Eur. J. Biochem., 168: 227-31 (1987) has discussed difficulties in the preparation of the higher plant 
enzyme ribulose-bisphosphate carboxylase. This enzyme has the subunit structure L^Ss. where Lisa large subunit and 
is a small subunit. In nature, a binding protein apparently maintains L in soluble form prior to assembly with S. Attempts 
to assemble an active higher plant RuBPCase in E, coji have been frustrated by the formation of an insoluble, inactive 
: aggregate of L. 

50 

H. Bacterial Expression of Human Alpha and beta Globins 

Nagai and Thorgerson (Nature, 309: 810-812, 1984) expressed in E. coli a hybrid protein consisting ofthe31 amino- 

_ terminal residues of the lambda ell protein, an lle-Glu-Gly-Arg linker, and the complete human beta globin chain. They 

55 cleaved the hybrid immediately after the linker with blood coagulation factor Xa. thus liberating the beta-globin chain. 
Later, Nagai, et al., P.N.A.S. (U.S.A.), 82:7252-55 (1985) took the recombinant DNA-derived human beta globin, naturally 
derived human alpha globin, and a source of heme and succeeded in producing active human hemoglobin. Because 
the alpha globin was derived from erythrocytes, the final product may contain undesirable erythrocyte membrane con- 
stituents. 


6 

BNSDOCID: <EP 070O997A1 I > 


It! 4 

EP 0 700 997 A1 

More recently, an efficient bacterial expression system for human alpha globin was reported. GB 8711614, filed 
May 16, 1987; see copending Ser. No, 07/194,338 and WO 88/09179. This led to the production of wholly synthetic 
human hemoglobin by separate expression of the insoluble globin subunits in separate bacterial cell lines, and in situ 
refolding of the chains in the presence of oxidized heme cofactor to obtain tetrameric hemoglobin. This procedure is 

5 laborious and low in yield. It requires the use of denaturing solvents (urea and guanidine), and chemical reduction of 
ferric ton to the ferrous state (see example). One object of the present invention is to overcome these disadvantages. 

While human alpha and beta globins may be expressed separately in E. coli . Walder, Proceedings, Biotech USA 
1988 (San Franciso, Nov. 14-16, 1988) warns at page 360, "isolated alpha and beta [globin] chains are unstable and 
tend to precipitate." If human alpha and beta globin are not produced in soluble form, they must be solubilized with 

io denaturing agents and then refolded to restore activity. Moreover, when a wild-type alpha globin gene is expressed in 
IL £0!L alpha globin accumulates only slowly. It is not certain whether this is due to inefficient translation or to the action 
of host proteases, but WO 88/091 79 teaches that this problem may be overcome by fusing a short section of the beta 
globin gene to the alpha globin gene, so that a hybrid protein is produced. This hybrid protein must then be cleaved, 
e.g., with a protease, to release the globin. If the protease is not completely selective (perhaps because of contamination 

15 by other proteases), the desired cleavage product may not be the only one obtained. In any event, that product must be 
separated from other goH polypeptides, and any contaminants associated with the protease. 

Sperm whale myoglobin has been expressed in con, demonstrating that bacteria can incorporate prosthetic heme 
groups into a protein expressed from a cloned eukaryotic gene. Springer and Sligar, PN AS (USA) 84: 8961 -65 (1 987). 
Walder says, "it remains to be seen if hemoglobin can be similarly made if both the alpha and beta chains are expressed 

20 within the same cell." While there is a high degree of tertiary structure similarity between myoglobin (a single chain 
protein) and the individual alpha and beta globin subunits, hemoglobin is a heterotetrameric protein, the primary globin 
sequences have no more than a 27% homology and myoglobin is now known to enjoy signif icantly higher stability than 
either alpha or beta globin. Thus, it could not be predicted that co-expression of alpha- and beta-globin in the same cell 
would result in intracellular assembly of a functional hemoglobin, which requires proper folding of the alpha and beta 

25 chains and incorporation of heme. 

I. Human Gene Expression in Yeast, Generally 

A number of human proteins have been expressed in transformed yeast cells, especially Saccharomvcescerevisiae. 

30 either cytoplasmically or by secretion into the culture medium. King, etal., Biochem. Soc. Transac, 16:1083-1086(1988). 
But success is not guaranteed. Thim, et al., FEBS Lett., 212:307-312 (1987) experienced difficulty in obtaining properly 
crosslinked insulin from yeast cells in which the intact proinsulin-encoding gene had been inserted. They overcame this 
problem by constructing a modified proinsulin gene which encoded the B and A chains linked by a hexapeptide spacer. 
The product of this gene was cleaved and the two chains were properly folded and crosslinked by the cells. 

35 Richardson, et al., Biochim. Biophys. Acta, 950:385-94 (1988) expressed the B chain of the heterodimeric protein 
ricin in E. coli . They reported that it was hard to obtain high levels of secretion of a yeast alpha factor leader/ricin B chain 
fusion protein. No attempt was made to co-express and assemble the ricin A and B chains. 

Murakami, et al., DNA, 6:189-97 (1987) reported production of a heme-containing fused enzyme in transformed 
yeast cells. 

40 Horwitz, et al., PNAS (USA), 85:8678-82 (Nov. 1988) described the construction of yeast strains which secrete 
functional mouse variable region/human lgG1 constant region chimeric antibodies into the culture medium. They char- 
acterize their paper the first report of the secretion of a foreign multimeric or heterodimeric protein in yeast. But see also 
Carlson, Mol. Cell. Biol., 8:2638-46 (June 1988)., showing transcription and translation of heavy and light-chain cDNAs 
into polypeptides which associate and bind antigen. 

45 Beggs, et al., Nature, 283:835 (1980) attempted to express a chromosomal rabbit beta globin gene in S. cerevisiae . 

However, these yeast cells were unable to properly splice the intron-containing globin mRNA transcript. 

No admission is made that any reference cited herein is prior art The description of the work and the citation of 
publication date are based solely on the published information and the applicants reserve the right to question the 
accuracy of that information. 

so 

SUMMARY OF THE INVENTION 

It is the object of this invention to overcome the aforementioned deficiencies of the prior art. For example, Applicants 
have achieved the first complete expression and assembly of tetrameric hemoglobin in cells which do not produce hemo- 
55 globin in nature. Prior work has related to the separate expression of alpha and beta globin and their extracellular com- 
bination with heme to form hemoglobin. 

A central feature of the present invention is the intracellular assembly of human alpha- and beta-globin-like polypep- 
tides and intracellular incorporation of heme to form a biologically functional hemoglobin-like protein. This intracellular 
assembly is achieved by expressing the alpha and beta-globin-like polypeptides in the same cell so that fold together 
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and incorporate heme. An important characteristic of this invention is a substantial reduction the formation of insoluble 
globin inaggregates, in particular of beta globin, as compared to what is observed when globins are separately expressed 
in E, coM or S. cerevisiae . Co-expression may be achieved from genes on two separate but compatible plasmids in the 
same cell, or from two different operons on the same plasmid, or from a single polycistronic operon. 

5 In one embodiment, the human alpha- and beta-globin-like polypeptides are co-expressed in bacterial cells. The 

corresponding genes may be included in the same synthetic operon (i.e., driven by one promoter), or placed in separate 
operons with separate promoters (which may be the same or different). Preferably, expression of the alpha- and beta- 
globin is enhanced by placing a "ribosomal loader" sequence as hereafter described before each globin gene. This is 
particularly advantageous in the case of alpha globin which is more difficult to produce in quantity. 

io in another embodiment, the human alpha- and beta-globin-like polypeptides are co-expressed in yeast cells. 
Improvements in both the yield of the alpha globin and the solubility of beta globin are obtained. 

A further aspect of the invention is the production of novel intermediates, human di-alpha globin and human di-beta 
globin (and mutants thereof), which can be expressed in a cell and assembled with each other or with beta- or alpha- 
globin-like polypeptides, respectively, into a human hemoglobin-like protein. While intracellular assembly is not strictly 

15 required, di-alpha and di-beta globin may be considered specially adapted to intracellular assembly of a functional hemo- 
globin since expression of, e.g., a di-alpha globin is analogous in some respects to intracellular assembly of two alpha 
globin sub-units, differing from assembly as previously discussed in that the association is accomplished by expression 
of a covalent peptide linker rather than by noncovalent interaction of the subunits. Di-alpha and Di-beta-globin-iike 
polypeptides may be expressed in, preferably, bacterial cells or in yeast cells. 

20 These facets of the invention are now discussed in greater detail. 

Yeast Expression of Hemoglobin-Like Proteins 

Applicant have discovered that it is possible to produce human hemoglobin (or mutants thereof) in yeast, especially 
25 Saccharomvces cerevisiae . The use of a yeast expression system obviates the need to separate the hemoglobin from 
bacterial endotoxins. We have also found that alpha and beta globins with the correct N-terminal amino acid may be 
obtained without first expressing the globin as a part of selectively cleavable fusion protein. We believe that this is 
because the yeast enzyme methionyl aminopeptidase should be capable of removing the N-terminal methionine from 
Met-alpha-globin and Met-beta-globin to expose the desired N-terminal amino acid (Valine). Production of altered oxygen 
30 affinity mutants as discussed in W088/09179 is of special interest. Such mutants may be produced by site-specific 
mutagenesis of globin genes followed by cfoning and expression in yeast. 

In a preferred embodiment, expression is controlled by a "gal-gap49" hybrid promoter as hereafter defined. 

Co-Expression of Alpha and Beta Globin Genes in Yeast Cells 

35 

In a preferred embodiment, the alpha and beta globin genes are both expressed within the same yeast cell. Expres- 
sion of the beta globin gene alone results in the production of beta globin as a largely insoluble, diff icult-to-extract protein 
comprising less than 2% of the total cell protein. Expression of the alpha globin gene alone results in production of alpha 
globin at very low levels (under 0.5% of the total cell protein). In neither case is heme incorporated. When, however, the 
40 alpha and beta globin genes are co-expressed, the transformed yeast cells fold the alpha and beta globin chains together 
and incorporate heme groups to obtain functional recombinant human hemoglobin in soluble form, accumulating to 
about 10% of the total cell protein, without any change in the promoters operably linked to the genes. 

The alpha and beta globin genes may, in turn, be carried on different plasmids or on the same plasmid within the 
host cell. 

45 

Polycistronic Co-Expre ssion of the Alpha and Beta Globin Genes in Bacterial Cells. 

Applicants have translationally coupled alpha and beta globin genes to a small "ribosomal loader" gene encoding 
a small consumable peptide that will lead the ribosome directly into the ATG of the desired alpha and beta globin message 

so and thus enhance translational efficiency. The have also placed the alpha and beta globin genes in the same operon 
so they are transcribed into a single polycistronic mRN A transcript. The globins are then translated as separate polypep- 
tide chains which subsequently are folded together and joined with intracellular heme by transformed cells to form the 
hemoglobin tetramer. Applicant's method overcomes the problem associated with separate purification of precipitated 
alpha and beta globins. . - 

55 The polycistronic expression and assembly of a heterooligomeric human protein in soluble, active form in a heter- 
ologous host has not been previously reported. It is especially noteworthy that this was a mammalian protein expressed 
in a prokaryotic (bacterial) host. It should further be considered that this protein incorporates prosthetic groups, which 
add a further complication to the goal of proper post-translational processing. 
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In one embodiment, Met-FX-alpha globin and Met-FX-beta globin are co-expressed, where FX denotes a leader 
peptide which a recognition site for Factor Xa cleavage activity. FX-alpha globin and FX-beta globin assemble to form 
a mutant hemoglobin with reversible oxygen binding activity, albeit higher in affinity for oxygen than native hemoglobin. 
Alternatively, the FX leader, or other fused leader, may be cleaved to obtain a duplicate of native Hgb. 

5 In another embodiment, Met-aipha globin and Met-beta globin are co-expressed. This eliminates the need for a 

cleavage step. 

In a third embodiment, des-val-alpha globin and des-val beta globin are co-expressed. Native alpha and beta globin 
both begin with valine. The valine may, however, be replaced with methionine, which is of similar hydrophobicity. 

In further embodiments, one or more codons of the native genes are altered so that a alpha and/or beta globin- 
10 related protein characterized by one or more amino acid differences (insertions, deletions or substitutions) from the 
native species is produced. 

Pi-Alpha and Pi-Beta Globins 

is A new protein, di-alpha globin, has been prepared, which consists of two alpha globin amino acid sequences cov- 
alently connected by peptide bonds, preferably through an intermediate linker of one or more amino acids. Surprisingly, 
these "genetically fused" alpha globin chains were capable of appropriately folding and combining with beta globin and 
heme to produce functional hemoglobin analogue. The term "genetically fused" refers to the method of production. Two 
copies of the globin gene are fused together, preferably with a spacer DNA encoding the amino acid linker, so the 

20 construct directly encodes the desired di-alpha globin. The term "analogue" is used because in native hemoglobin, the 
alphal and alpha2 subunits are noncovalently bound. The analogous preparation of di-beta globin is also envisioned. 

The preparation of "genetically fused" hemoglobins avoids the disadvantages of chemical crosslinkng. The latter 
is inefficient and often requires deoxygenation of the hemoglobin solution and the presence of another molecule (e.g., 
inositol hexaphosphate or 2,3-DPG) to prevent competing reactions. 

25 In a preferred embodiment, the di-alpha globin and/or the beta globin contain mutations which reduce the oxygen- 

binding affinity of the hemoglobin analogue in solution so as to approach the oxygen-binding characteristics of whole 
blood. 

The di-alpha hemoglobin advantageously exhibits a substantially longer half-life in the circulatory system than does 
conventional (des-val) recombinant hemoglobin. Preferably, in humans, the half-life exceeds 9 hours at a dose of at least 
30 1 gm/kgm body weight. This would be expected to correspond to a half-life of about 3 hours in rats given a comparable 
dose. 

The di-alpha and di-beta globins can be expressed in both bacteria and yeast. 
BRIEF DESCRIPTION OF THE DRAWINGS 

35 

Flowchart for construction of plasmids for expression of FX-alpha globin (pDL ll-62m), FX-beta globin 
(pDL 11-1 0a), and FX-hemoglobin (pDL ll-66a) are schematized. 

Flowcharts for construction of plasmid pDL I II- 1a (2a) bearing dicistronic Des-Val-Alpha globin gene under 
control of Tac promoter, and polycistronic di-alpha/beta co-expression plasmid pDL lll-47a (2b). 

Flowchart for construction of plasmid for co-expression of Met-alpha and Met-beta globins, pDL lll-13e. 

Oligonucleotides for construction of synthetic FX-alpha and FX-beta globin genes. The top strand is shown 
5' to 3' and the bottom strand as 3* to 5*. Areas of overlap between complementary synthetic oligonucle- 
otides are shown as areas where both strands are shown in the same case letters. The Pstl site that joins 
FX-alpha and FX-beta occurs at the overlap of SJH l-35a and SJH l-36b. 

Synthetic gene for expression of Met-FX-alpha and Met-FX-beta globin. Region A contains the alpha 
globin gene and region B the beta globin gene. The location of the Factor X sequence and the two Shine- 
Delgarno sequences (SD#1 and SD#2) in both regions is indicated. Selected restriction sites are also 
found. The translated amino acid sequences for the ribosomal loader and Met-FX-alpha/and beta-globin 
are given. 

Eiution profile and absorbance spectrum for FX-hemoglobin. 

Oligonucleotides for construction of mutant hemoglobins (differing in amino acid sequence from conven- 
tional hemoglobin). 


Figure 1 : 
Figure 2 : 

40 

Figure 3 : 
Figure 4 : 

45 

Figure 5 : 

so 

55 Figure 6 : 
Figure 7 : 
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Oligonucleotides for construction of plasmids which do not encode the Factor Xa substrate recognition 
site. 


15 


20 


25 


30 


40 


45 


50 


55 


Figure 10 : 
Figure 1 1 : 
Figure 12 


Figure 13 : 

Figure 14 : 
Figure 15 : 
Figure 16 : 
Figure 1 7 : 
Figure 18 : 
Figure 19 : 

35 Figure 20 : 


Figure 21 : 
Figure 22 : 

Figure 23: 


Figure 24 : 
Figure 25 : 


Oxygen Binding of Des-Fx Hgb 
Plasmids pDL lll-14c and pDL lll-38b 

Shows the sequence of a preferred synthetic gene for expression of (des-Val) -alpha- (GlyGly)-alpha and 
des-Val beta globin. A shows the region (EcoRI to Patl) containing Shine-Delgarno ribosomal binding 
sites (SD#1 and SD#2), the sequence expressing the octapeptide (Met...Glu) which serves as a cotrans- 
lational coupler, and the sequence encoding the two nearly identical alpha globin-like polypeptides and 
the interposed Gly-Gly linker. The first alpha globin sequence begins "Met-Leu", that is, it contains an 
artifactual methionine, omits the valine which is the normal first residue of mature alpha globin, and con- 
tinues with the second residue, leucine. The second alpha globin sequence begins "Val-Leu", immediately 
after the underlined "Gly-Gly" linker. Start and stop codons are underlined. B shows the analogous region 
(Pstl to Hindi II) containing the coding sequence for des-Val beta globin. A and B are connected at the 
Pstl site to form a single polycistronic operon. 

Shows the structure of the final expression vector pDL lll-47a. "PTac" is the Tac promoter, and , *ampicillin ,, 
is the ampicillin resistance gene. Figure 13a shows an Xbal-BamHI fragment of PDL ill-47a. 

Plasmid pSGE0.0E4 
Plasmid pSGE1.1E4 
Plasmid pDL IV-67a 
Plasmid pJR Vl-54a 
Plasmid pDL di-alpha/beta/beta 

Flowchart showing the construction off various expression vectors featuring lambda P L regulation of var- 
ious polycistronic globin operons. 

Nucleotide Sequence of GAL-GAP promoter, with restriction sites indicated. The region from Sphl to 
EcoRV contains a synthetic GAL^o regulatory region (M. Johnston and R. Davis. 1984. Molecular and 
Cellular Biology, 4:1440-1448). The UAS is in the region numbered 29-63 on this Figure. The region from 
EcoRV to the Xbal site contains the consensus GAP491 transcriptional start region, with the approximate 
start of transcription being at395. (L McAlister and M.J. Holland. 1983. J. Biol. Chem., 260:15019-15027; 
J. Holland, et al. 1983. J. Biol. Chem., 25^:5291-5299.) 

Flowcharts showing construction of beta-globin expression cassette (21a and 21b). 

Flowcharts showing construction of beta-globin expression vector pGS4988 (22a) and alpha-globin 
expression vector pGS4688 (22b). 

Flowcharts (23a and 23b) showing construction of an alpha-globin expression cassette and of vectors 
pGS289 and pGS389 for co-expression of alpha- and beta globin from the same plasmid. Note that alpha 
and beta globin are expressed from separate promoters. 

Absorbtion spectra for yeast-produced recombinant and native human hemoglobin. 

Flowchart (25a) showing construction of di-alpha/beta hemoglobin yeast expression vector and map of 
plasmid pGS3089 (25b) - _ . 


Figure 26 : Map of plasmid pGS3089RGV desp. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The structure of conventional hemoglobin is well known. We herewith incorporate by reference the entire text of 
Bunn and Forget, eds., Hemoglobin: Molecular. Genetic and Clinical Aspects (W.B. Saunders Co., Philadelphia, PA: 
s 1986) and of Fermi and Perutz "Hemoglobin and Myoglobin," in Phillips and Richards, Atlas of Molecular Structures in 
Biology (Clarendon Press: 1981). 

The primary structure of a polypeptide is defined by its amino acid sequence and by identification of any modifications 
of the side chains of the individual amino acids. 

About 92% of the normal adult human hemolysate is Hgb A (designated alpha2 beta2, because it comprises two 
io alpha and two beta chains). The alpha chain consists of 141 amino acids (See Figure 11). The iron atom of the heme 
(ferroprotoporphyrin IX) group is bound covalently to the imidazole of his 87 (the "proximal histidine"). The beta chain is 
146 residues long (see Figure 12) and heme is bound to it at his 92. 

Other recognized hemoglobin species are Hgb A 2 (a 2 a 2 62), Hgb A 1a , Hgb A 1b , and Hgb A 1c , as well as the rare 
species Hgb F (a 2 gamma^, Hgb Gower-1 (Zeta 2 epsilon^, Hgb Gower-2 (alpha 2 epsilon^, Hgb Portland (2eta 2 
15 gamma^, and Hgb H (beta4) and Hgb Bart (gamma^. They are distinguished from Hgb A by a different selection of 
polypeptide chains. 

Segments of polypeptide chains may be stabilized by folding into one of two common conformations, the alpha helix 
and the beta pleated sheet. In its native state, about 75% of the hemoglobin molecule is alpha-helical. Alpha-helical 
segments are separated by segments wherein the chain is less constrained. It is conventional to identify the alpha-helical 

20 segments of each chain by letters, e.g., the proximal histidine of the alpha chain is F8 (residue 8 of helix F). The non- 
helical segments are identified by letter pairs, indicating which helical segments they connect. Thus, nonhelical segment 
BC lies between helix B and helix C. In comparing two variants of a particular hemoglobin chain, it may be enlightening 
to attempt to align the helical segments when seeking to find structural homologies. For the amino acid sequence and 
helical residue notation for conventional human hemoglobin Aq alpha and beta chains, see Bunn and Forget, supra, and 

25 Table 1 herein. 

The tertiary structure of the hemoglobin molecule refers to the steric relationships of amino acid residues that are 
far apart in the linear sequence, while quaternary structure refers to the way in which the subunits (chains) are packed 
together. The tertiary and quaternary structure of the hemoglobin molecule have been discerned by X-ray diffraction 
analysis of hemoglobin crystals, which allows one to calculate the three-dimensional positions of the very atoms of the 
30 molecule. 

In its unoxygenated ("deoxy", or "T for "tense") form, the subunits of hemoglobin (alphal , alpha2, betal , and beta2) 
form a tetrahedron having a twofold axis of symmetry. The axis runs down a water-filled "central cavity". The subunits 
interact with one another by means of Van der Waals forces, hydrogen bonds and by ionic interactions (or "salt bridges"). 
The alphal betal and alpha2beta2 interfaces remain relatively fixed during oxygenation. In contrast, there is considerable 
35 flux at the alphal beta2 (and alpha2 betal ) interface. In its oxygenated ("oxy", or "R" for "relaxed" form), the intersubunit 
distances are increased. 

The tertiary and quaternary structures of native oxyhemoglobin and deoxyhemoglobin are sufficiently well known 
that almost all of the nonhydrogen atoms can be positioned with an accuracy of 0.5 A° or better. For human deoxyhe- 
moglobin, see Fermi, et al., J. Mol. Biol., 175: 159 (1984), and for human oxyhemoglobin, see Shaanan, J. Mol. Biol., 
40 171:31 (1 983), both incorporated by reference. 

While analyses of hemoglobin structure tend to focus on the alpha-beta interfaces, it is known that the distance 
between the amino terminus of one alpha subunit and the carboxyl terminus of the other is about 5.6 A° in the deoxy 
configuration and 3.3 A° in the oxy configuration. 

For the purpose of the appended claims, a hemoglobin-like protein is an oxygen binding protein with a plurality of 
45 heme prosthetic groups and a P50 of at least about 6 torr (when measured as for Table 1 of Ser. No. 07/194,338), and 
composed of a plurality of polypeptides each of which is a human alpha (or di-alpha) globin-like or human beta (or di- 
beta) globin-like polypeptide. A human alpha globin-like polypeptide is native human alpha globin or a mutant thereof 
differing from the native sequence by one or more substitutions, deletions or insertions, while remaining at least 75% 
^ homologous with human alpha globin, and still capable of incorporating heme and associating with beta globin. A beta 
50 globin-like subunit is analogously defined. Subunits of animal hemoglobins or mutants thereof which are sufficiently 
homologous with human alpha or beta globin are embraced by the term "human alpha or beta globin-like polypeptide." 
For example, the subunits of bovine hemoglobin are within the scope of these terms. The alpha- and beta-globin-like 
polypeptides may be referred to collectively as "globins". 

For the purpose of the appended claims, a di-alpha globin-like polypeptide is one which consists essentially of two 

55 alpha-globin-like polypeptide sequences connected by peptide bonds between the normal C-terminus of the first alpha- 
globin-like polypeptide and the normal N-terminus of the second alpha-globin-like polypeptide. These two sequences 
may be directly connected, or connected through a peptide linker of one or more amino acids. Alpha globin chains 
crosslinked at the N- and C-terminals other than by peptide bonds (e.g., by DIDS) are excluded. The di-alpha globin- 
like polypeptide must be capable of folding together with beta globin and incorporating heme to form functional hemo- 
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globin-fike protein. The di-beta globin-like polypeptide is analogously defined. The di-alpha and di-beta globin-like 
polypeptides may collectively be referred to as "pseudodimeric globin-like polypeptides". 

A "Met FX alpha globin" is an alpha globin-like polypeptide comprising an N-terminal methionine, a oligopeptide 
which acts as a recognition site for Factor Xa (e.g., Ile-Glu-Gly-Arg), and an alpha globin-like sequence (e.g., Val-His- 
Leu-Thr-Pro...) which may correspond to wild-type alpha globin or to a mutant thereof as taught herein. The term "Met 
FX alpha globin" is sometimes abbreviated as "FX alpha globin". "FX beta globin" is an analogously defined beta globin- 
like polypeptide. 

"Met-alpha globin" is an alpha globin-like polypeptide with an extra N-terminal methionine. The second amino acid 
is valine, which is the first amino acid of mature wild-type alpha globin. Met-beta globin is analogously defined. A "Des- 
FX alpha globin" gene (or "dFFX alpha globin") is a Met-alpha globin gene obtained by excising the FX codons from a 
Met-FX alpha globin gene. Note that "Met-Hgb" is used to refer to methionyl Hgb formed from methionyl-alpha globin 
and methionyl-beta globin. 

"Des-Val-alpha globin" (or "dVal alpha globin") is an alpha globin-like polypeptide wherein methionine is substituted 
for the valine which begins the sequence of mature wild-type alpha globin. Des-Val-beta globin is analogously defined. 
Des-Val-alpha/alpha globin (di-des Val alpha globin) is a "di-alpha globin" in which a "Des-Val-alpha" sequence is linked 
via an appropriate peptidyl linker to an alpha globin-like sequence which begins with Val. 

The alpha and beta globin-like chains need not correspond exactly in sequence to the alpha and beta globins of 
"conventional" hemoglobin. Rather, mutations may be introduced to alter the oxygen affinity or stability of the hemoglobin, 
or the ease of expression and assembly of the individual chains. By way of example and not limitation, several mutant 
hemoglobins have been prepared by the method of this invention. Guidance as to further mutations is provided by the 
copending application of Hoffman and Nagai, Ser. No. 07/194,338, incorporated by reference herein. 

The DNA sequences encoding the individual alpha (or di-alpha) and beta (or di-beta) globin chains may be of 
genomic, cDNA and synthetic origin, or a combination thereof. Since the genomic globin genes contains introns, genomic 
DNA must either be expressed in a host which can properly splice the premessenger RNA or modified by excising the 
introns. Use of an at least partially synthetic gene is preferable for several reasons. First, the codons encoding the desired 
amino acids may be selected with a view to providing unique or nearly unique restriction sites at convenient points in 
the sequence, thus facilitating rapid alteration of the sequence by cassette mutagenesis. Second, the codon selection 
may be made to optimize expression in a selected host. For codon preferences in E. coli . see Konigsberg, et al., PNAS, 
80:687-91 (1983). For codon preferences in yeast see the next section. Finally, secondary structures formed by the 
messenger RNA transcript may interfere with transcription or translation. If so, these secondary structures may be elim- 
inated by altering the codon selections. 

Of course, if a linker is used to genetically crosslink subunits, the linker will normally be encoded by a synthetic 
DNA. While the di-alpha globin and the beta globin may be expressed separately and then combined with each other 
and heme in vitro, they are preferably placed on one plasmid. 

The present invention is not limited to the use of any particular host cell, vector, or promoter. However, the preferred 
host cells are bacterial (especially, E. coli) and yeast (especially £L cerevisiae) cells. The promoter selected must be 
functional in the desired host cells. It preferably is an inducible promoter which, upon induction, provides a high rate of 
transcription. A preferred bacterial promoter is the Tac promoter, a trp/jac hybrid described fully in DeBoer, U.S. 4,551 ,433 
and commercially available from Pharmacia-LKB. Other promoters which might be used include the temperature sen- 
sitive lambda P L and P R promoters, as well as the lac, trp, trc, pIN (lipoprotein promoter and lac operator hybrid), gal 
and heat shock promoters. The promoter used need not be identical to any naturally-occurring promoter. Guidance for 
the design of promoters is provided by studies of promoter structure such as that of Harley and Reynolds, Nucleic Acids 
Res., 15:2343-61 (1987) and papers cited therein. The location of the promoter relative to the first structural gene may 
be optimized. See Roberts, et al., PNAS (USA), 76:760-4 (1 979). The use of a single promoter is favored. Suitable yeast 
expression systems are described in detail elsewhere in this specification. 

The vector used must be one having an origin of replication which is functional in the host cell. It desirably also has 
unique restriction sites for insertion of the globin genes and the desired regulatory elements and a conventional selectable 
marker. A vector may be modified to introduce or eliminate restriction sites to make it more suitable for futher manipu- 
lations. 

The alpha and beta globin chains may be expressed either directly or as part of fusion proteins. When expressed 
as fusion proteins, the latter may include a site at which they may be cleaved to release the alpha and beta globin free 
of extraneous polypeptide. If so, a site sensitive to the enzyme Factor Xa may be provided, as taught in Nagai and 
Thorgenson, EP Appl 1 61 ,937, incorporated by references herein. Alternatively, the alpha and beta fusion proteins may 
be synthesized, folded and heme incorporated to yield a hemoglobin analogue. 

The direct expression of the alpha and beta globin subunits is desirable. Factor Xa is a blood derivative. Preparations 
of Factor Xa may therefore contain undesirable blood-associated substances or etiologic agents. In any event, the hemo- 
globin must be separated from the Factor Xa. 

Nagai and Thorgerson, EP Appl 161 ,937, incorporated by reference herein, teach the construction of fused genes 
in which DNA coding for a polypeptide of interest is immediately preceded by DNA encoding a cleavage site for Factor 
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Xa, a serine protease. Certain of the peptide sequences to be cleaved by Factor Xa are quoted below (wherein the 

cleavage site is denoted by an *v): 

lle-Glu-Aly-Arg=Val-His-Leu-Thr Cll FxB-globin 

lle-Glu-Gly-Arg=Thr-Ala-Thr-Ser Hu prothrombin 
5 lle-Glu-Gly-Arg=Thr-Ser-Glu-Asp Bo prothrombin 

lle-Asp-Gly-Arg=lle-Val-Glu-Gly Hu prothrombin 

lle-Glu-Gly-Arg=lle-Val-Glu-G1y Bo prothrombin 

Ala-Glu-Gly-Arg=Asp-Asp-Leu-Tyr Hu antithrombin III 

In the above list, "CIIFXp-globin" refers to a hybrid fusion protein comprises the 31 amino-terminal residues of 
io the lambdaCII protein, the Factor Xa recognition sequence "lle-Glu-Gly-Arg," and the complete amino acid sequence 

of human beta globin (which begins *Val-His-Leu-Thr-..."). It will be evident from study of Figure 4 of the present invention 

that FX-alpha and FX-beta globins of Example 1 correspond to the native globin preceded by "Met-lle-Glu-Gly-Arg." 
In bacterial mRNA, the site at which the ribosome binds to the messenger is a polypurine stretch which lies 4-7 

bases upstream of the start (AUG) codon. The consensus sequence of this stretch is 5'... AGGAGG. ..3', and is frequently 
is referred to as the Shine-Dalgarno sequence. Shine and Dalgarno, Nature, 254: 34 (1 975). The exact distance between 

the SD sequence and the translation^ start codon, and the base sequence of this "spacer" region, affect the efficiency 

of translation and may be optimized empirically. Shepard, et al., DNA 1 : 1 25 (1985); DeBoer, et al., DNA 2: 231 (1 983) ; 

Hui, et al., EMBO J., 3: 623 (1984). 

In addition, the SD sequence may itself be modified to alter expression. Hui and DeBoer, PNAS (USA), 84:4762- 
20 66 (1987). Comparative studies of ribosomal binding sites, such as the study of Scherer, et al., Nucleic Acids Res., 

8:3895-3907 (1907), may provide guidance as to suitable base changes. If the hemoglobin is to be expressed in a host 

other than EL £Qli, a ribosomal-binding site preferred by that host should be provided. Zaghbil and Doi, J. Bacteriol., 

168:1033-35(1986). 

Any host may be used which recognizes the selected promoter and ribosomal binding site and which has the capa- 
25 bility of synthesizing and incorporating heme. Bacterial and yeast hosts are preferred. 

The intracellularly assembled hemoglobin may be recovered from the producing cells and purified by any art-rec- 
ognized technique. 

Polvcistronic Co-Expression of Alpha and Beta Globins and Their Assembly Into Hemoglobin 

30 

In one embodiment, expression of the alpha and beta globin genes is driven by a single promoter, and the genes 
are arranged so that a polycistronic messenger RNA transcript is transcrtoed, from which the separate alpha and beta 
globin polypeptides are subsequently translated. However, the present invention includes the co-expression of the alpha 
and beta globin genes from separate promoters, i.e., the host transcribes separate alpha and beta globin mRNAs. 

35 The use of a single promoter is favored on theoretical grounds. Ideally, alpha and beta globin are expressed in 
stoichiometrically equal amounts. While use of a single promoter does not guarantee equality, it eliminates one unbal- 
ancing influence - differences in transcription owing to differences in promoter strength and accessibility. If differences 
in promoter strength were minimized by use of two identical promoters on the same plasmid, plasmid stability would be 
reduced as there would be a propensity toward recombination of the homologous regions. We note, however, that in 

40 preliminary experiments we have co-expressed alpha and beta globins from separate promoters. 

Another justification for using a single promoter is to minimize the number of repressor binding sites. 
Preferably, the alpha and beta globin genes are arranged so that the ribosome will translate the alpha globin cistron 
first. The rationale is that there is some basis for believing that alpha globin affects the folding of beta globin. Nonetheless, 
the position of the genes may be switched so that beta globin is synthesized first, as is shown in Example 6. 

45 The stability of the polycistronic mRNA transcript, the efficacy of its translation into alpha and beta globin, and the 
folding of the globin chains into tetrameric hemoglobin may be modified by varying the length and base sequence of the 
intercistronic regions (the region lying between the stop codon of one cistron and the start codon of the next cistron), 
the phasing of a second cistron relative to a first cistron, and the position and sequence of the ribosomal binding site 
for the one cistron relative to the preceding cistron. 

so In a preferred embodiment, the alpha and beta globin genes are each preceded by a short "introductory" cistron or 

"ribosomal loader" which facilities the subsequent translation of the globin cistron. In Figure 4, region A contains two 
cistronsand a Shine-Delgarno sequence preceeding each cistron. The first Shine- Delgarno sequence (SD#1) is bound 
by the ribosome. which then translates the first cistron, a short cistron encoding an octapeptide. (This cistron is referred 
to as an "introductory cistron or ribosomal loader.) Thr second cistron is a globin gene, in this case, an FX alpha-globin _ _. 

55 gene. The Shine-Delgarno sequence (SD#2) for facilitating translation of the second cistron actually lies within the first 
cistron. For this reason, the two are said to be "translationally coupled". Region B is identical in structure, except that 
the second cistron encodes FX-beta globin. Between regions A and B is a 43-base intercistronic region. The introductory 
cistrons of regions A and B correspond to the first cistron of the two-cistron expression system denoted pCZ144 in 
Schoner, et al., Meth. Enzymol., 153: 401-16 (1987). The present invention is not, however, limited to the particular 
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"starter cistron taught by Schoner, et al.; other introductory cistrons that allow for restart of high level translation of a 
following cistron may be used. 

Guidance as to the design of intercistronic sequences and as to the location of SD sequences may be obtained by 
comparing the translational efficiency of spontaneous or controlled mutants of the same polycistronic operon, as exem- 
plified by Schoner, et al.. PNAS, 83: 8506-10 (1 980). It is also possible to look for consensus features in the intercistronic 
regions of different operons. McCarthy, et al., EMBO J., 4: 519-26 (1985) have identified a translation-enhancing inter- 
cistronic sequence in the E. coli atp operon. 

The present invention is intended to reduce or avoid the localization of the hemoglobin or its component polypeptides 
into inclusion bodies. Consequently, a further feature of the invention is that the functional hemoglobin is substantially 
found (preferably over 80%) in the soluble fraction of the cell. It appears that with this invention, over 90% of the functional 
hemoglobin can be so directed when alpha 2 beta 2 hemoglobin is assembled from alpha-and beta-globin chains coex- 
pressed from a tetracistronic operon as described herein. With di-alpha, beta 2 hemoglobin, nearly 1 00% is soluble when 
expression is induced at 25°C and less at higher induction temperatures. These percentages reflect the percent of all 
di-alpha and beta chains found in the soluble fraction of the cell and not actual recovery of protein from the cell. 

Expression in Yeast 

In another embodiment the present invention relates to the production of hemoglobin-like molecules in yeast. Our 
preferred host for expression of recombinant human hemoglobin in yeast is Saccharomvces cerevisiae . However, other 
fungi or yeast may be used for the purpose, such as strains of Aspergillus or Pichia . For yeast to be a suitable host it 
must be capable of being transformed with recombinant vectors, either replicating or integrating types. This allows the 
insertion of the desired DNA sequence for the gene of interest. It must also be capable of high density cell growth, in 
appropriate volume to provide sufficient cell mass to isolate the desired gene product from the desired reaction vessels, 
where ideally the growth would be easily controlled by several parameters including nutrient formulation, agitation and 
oxygen transfer and temperature. It is also desirable to be able to induce the expression of protein synthesis with the 
manipulation of the media, temperature, or by the addition or consumption of certain chemicals. Finally, to be a suitable 
host, the yeast must be capable of producing recombinant proteins, preferably in excess of 1% of the total cell protein. 
This allows more facile isolation of the desired recombinant protein. 

With reference to S. cerevisiae. haploid strains of potential use include: 


BJY3501 Mata pep4::HIS3 prbl -A 1 .6R his3 200 ura3-52 can 1 GAL, 

GSY112 As above but Leu2::HISG 

GSY1 1 2 cir As above but cured of 2ji plasmid 

BJY3505 Mata pep4::HIS3 prbl -A 1 .6R HIS3 lys2-208 trpl-1 01 ura3-52 gal2 canl 

GSY1 1 3 As above but leu2::HISG 

RSY330 Matapep4-3 prbl-1 122 hist7 ura3-52 trp1-289can1 gall 

BJY21 68 Mata prd -407 prbl -1 1 22 pep4-3 Ieu2 trp1 ura3-52 

BJY1991 Mata prb1-1 122 pep4-3 Ieu2 trp1 ura3-52 

RSY334 Mata regl-501 pep4-3 prbl -1 122 ura3-52 leu2-3, 1 12 gall 

RSY214 Mata pep4-3 prbl ura3-52 

To date, strains such as GSY1 1 2. GSY1 1 3 and RSY334 have been the best hemoglobin producers. Strains such 
as RSY334 that carry the reg1-501 mutation may be particularly important as they uncouple glucose repression from 
galactose induction, allowing one to induce with lower levels of galactose in the presence of glucose. Because this strain 
carries the gall mutation it cannot metabolize galactose, so the galactose concentration remains constant and continues 
to function as a inducer. 

Diploid strains formed from crosses of any of the above compatible strains or similar compatible strains may also 
be useful as they tend to have faster growth rates than haploid strains. 

For example, the following diploid strains, which coexpress alpha (or di-alpha) and beta globins, are described in 
the Examples: 

BJY350 [pGS4988] x RSY330 [pGS4688] 
BJY3505 [pGS4988] x BJY 1 991 [pGS4688] 

Other matings may likewise be used in practicing the present invention. 

--The use of protease-deficient strains may also be advantageous. - - 

Yeast expression systems can be divided into two main categories: (1) Systems designed to secrete protein and 
(2) system designed for the cytoplasmic expression of proteins. The advantages of secretion systems are: 

* 

(1) The protein is often easier to purify from culture medium than from total cell extracts. 
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(2) If the proteins has essential disulfide bonds, they are more likely to form if the protei n passes through the secretory 
pathway This is thought to be partly due to the presence of protein disulfide isomerase in the endoplasmic reticulum 
and to a less reducing environment than in the cytoplasm. 

5 (3) Secreton can also be advantageous if the first amino acid (after the initiating methionine) creates a dipeptide 

seq jence that is poo* iy processed by methionyl arninopeptidase. The addiction of a secretory signal sequence may 
allow pf ocei>s-ng t>y signal peptidase during secretion resulting in a protein with the authentic amino acid at the 
amino !e* m*nus of the prctem. 

10 The disad^artages of a secretion system are: 

(1) Gene* aft* tr>« expression level are much lower than those that can be obtained by cytoplasmic expression. This 
seems to&e n part, due to a rate limiting step in secretion. 

is (2) Not afl p»oteri* an? stsdetable. 

(3) often. pa*t>cUar> wrth 5 cerevisiae, misprocessed forms of the protein accumulate and these can be difficult to 
purify awa f trorr cc»»ectt> processed forms. 

The moct comnoniy uood secretory expression system used in yeast is the yeast a-factor secretory signal 
sequences and me facto* promoter. See A.J. Brake. Yeast Genetic Engineering Eds. P. Barr, A. Brake, and P. Valen- 
zuela. Butterwo*th Publishing. Boston 1 989, for a review. The invertase signal sequence coupled to a variety of promoters 
has also been used to e* press heterologous proteins in yeast See G. Stetler et al., Biotechnology, 7:55-60 (1989), R. 
Smith et al., Saence 223 1?19-1229 (1985). 

The advantages of cytopUsjTuc expression are: 

(1) Expression levels can be quite high with reports of proteins expressed at >20% of the total cell protein, usually 
as a soluble p*ore*n 

30 (2) Some proteins cannot be efficiently secreted. 

(3) Proteins that contain glycosylation sites, if secreted from yeast, will be glycosylated with the pattern of sugars 
unique to yeast Because these highly hydrophilic, post-translational modifications lie on the surface of proteins they 
are likely to confer antigenicity on the protein. If the protein is functional without the carbohydrate side chains it may 
be advantageous to produce the protein without them rather than with the yeast-specific modification (see for ex. 
Travis et al. 1985 J Biol Chem 260:4384-4389). 

(4) Some proteins have other specific modifications that may occur only in the cytoplasm, for example aminoterminal 
acetylation, modrf ication with lipids, perhaps heme incorporation and so forth. 

At present, cytoplasmic expression is preferred since the yeast cells fold together the globin chains and incorporate 
heme to produce hemoglobin in vivo. However, it is possible to separately express and secrete the alpha and beta globin 
chains and assemble hemoglobin in vjtro. 

The globin genes must be placed under the control of a suitable promoter. The commonly used yeast promoters 
generally fall into two broad categories: regulated and constitutive. Constitutive promoters that are in wide use include 
GAP, PGK (phosphoglycerate kinase) and the a-factor promoter Regulated promoters have also been used and these 
include the yeast metal lothionein promoter (regulated by copper), the Gah -10 promoter, GAL7 promoter (regulated by 
galactose and glucose) the ADHII promoter (regulated by ethanol and glucose) the PH05 promoter (phosphate regula- 
tion) and several hybrid promoters such as PH05-GAP, GAL-PGK, ADHII-GAP, and GAL-CYC 1 . 

The use of regulated promoters may be important for plasmid stability. Often expression of recombinant proteins, 
at high levels, inhibits the growth of the host organism. This disadvantage can result in the accumulation of cells in the 
population with few copies of the plasmid present, a decrease in growth rate and a drop in overall expression levels. By 
maintaining the gene in a repressed state and inducing late in the fermentation, high growth rates can be maintained 

and selection against high plasmid copy numbers can be avoided .._ _ .. . _ 

It is somewhat difficult to obtain accurate data on the relative strength of yeast promoters because the only true 
measure of this would be based on kinetics of mRNA synthesis. Most of the data that exists measures only the final 
protein concentration that has been obtained. Because there can be great differences in protein stability, it is necessary 
to compare the same protein and mRNA with multiple promoters on identical vectors. A study that approaches this was 
done by Verbakel et al. (Gene. 61:207-215 (1987)). They compared the GAPDH, CYC1 GAL7, PH05, and PGK pro- 
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moters and measured the expression of a p-galactosidase/poliovirus VP2 protein fusion. CYC1 resulted in an expression 
level of 0.14% of the total cell protein (TCP); GAPDH, 0.22% TCP; PHOS, 0.26% TCP; PGK, 0.9% TCP- and GAL7 
0.96% TCP 

Using our GALGAP hybrid promoter, we have obtained expression levels of hemoglobin that represents ==15% of 

s the total cell protein. This most probably represents an increased promoter strength along with a highly stable protein 
product and perhaps mRNA with an extended half life. 

The use off a GAL-GAP hybrid promoter is preferred. Both elements (the GAI_u AS and the GAP transcriptional 
initiation site) are well understood. Studies on the mechanisms of transcriptional regulation off the GAL regulon have 
been fairly extensive. The galactose regulon includes five genes that encode enzymes required for the utilization of 

w galactose. Four off these genes (GAL1 , GAL7, GAL10, and GAL2) are expressed only in the presence of galactose. 
Galactose induction does not occur in the presence of glucose unless the yeast strain bears a mutation in the REG1 
gene. The GAL1 , 7, 10 and 2 genes are regulated by at least two other genes, GAL80 and GAL4. The GAL4 gene is a 
transcriptional activator protein that activates mRNA synthesis from the GAL1 , 7, 1 0 and 2 upstream activator sequences 
(UASqal)- Although GAL4 is constitutively expressed, it is functionally silent in the absence of galactose. Repression 

15 of GAL4 activity, in the absence of galactose is maintained by the product of the GAL80 gene. The GAL80 protein 
apparently interacts physically with GAL4 to prevent transcriptional activation. Presumably galactose or a galactose 
derivative prevents this interaction to allow GAL4 mediated induction. 

Haploid strains of S.cerevisiae have three different genes encoding the enzyme glyceraldehyde-3-phosphate dehy- 
drogenase (GAP). These genes have been designated TDH1 P TDH2 and TDH3 and each is present as a single copy 

20 per haploid genome. The TDH3 gene produces approximately 60% of the cell's GAP enzyme and TDH1 and 2 produce 
about 12% and 28%, respectively (McAllister, Land M.J. Holland, 1985. J. Biol Chem, 260: 15019-15027). Holland's 
group (Holland et al. 1981. J. Biol Chem, 256:1385-1395; and Holland et al. 1983. J Biol Chem 258:5291-5299) has 
cloned and characterized the three GAP genes of S.cerevisiae. The clones have been designated pGAP1 1, pGAP63, 
and pGAP491 pGAP491 corresponds to the TDH3 gene and is therefore, the most highly expressed. This promoter 

25 has been used to express a wide variety of proteins in yeast including: 



Protein 

REF 

30 

a Interferon 

0) 


Hepatitis B antigen 

0) 


Thaumatin 

(2) 

35- 

Hepatitis B antigen 

(3) 

HIV III reverse transcriptase 

(4) 


human SOC 

(5) 


a1 antiprotease 

(6) 

40 
45 

(1) Bitter, G. and K.M. Eagan. 1984. Gene, 32.263-274. 

(2) Edens, L et al. 1984. Cell, 37:629-633. 

(3) Kitano, K. et al. 1987. Biotechnology 5:281-283. 

(4) Hallewell, R.A. etal 1987. Biotechnology 5:363- 
366 (5) Barr, P.J. etal. 1987. Biotechnology 5:486-489. 
(6) Travis, J. et al. 1985. J Bid Chem 260:4384-4389. 


This promoter is commonly used as a 600-850bp fragment and is essentially un-regulated. In its long form this is a 
very powerful promoter. The form we are using consists of only ^200bp 5* of the translations initiation site. This form, 
with no added enhancer sequences is substantially less active than the longer form of the promoter (Edens, L. et al. 
Cell, 2Z:629 (1984)). Our addition of the GAL enhancer region confers both regulation and high levels of expression. 
With only the GAP491 promoter, alpha and beta globin were produced at a level of less than 0.2% total cell protein; with 
the GAL-GAP491 hybrid promoter, expression jumped to 7-1 0% total cell protein. . . . _ 

Several other hybrid promoters are of particular interest: 
GAL-SIGMA 

A strong galactose-regulated promoter with the sigma transcriptional start site 
SIGMA-GAP 

A strong peptide hormone-regulated promoter with the GAP491 transcriptional start site. 
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GAL-EF III 

A strong galaciose-regulated promoter with the elongation factor III transcriptional start site. 
SIGMA-EF III 

A strong peptide hormone-regulated promoter with the elongation factor III transcriptional start site. 

5 One could easily conceive of other promoter systems that would also work. This would include, but not be limited 

to, a variety of constitutive promoters. For example, the yeast mating factora (MFa) promoter or the mating factor a 
promoter MF(a), the phosphoglycerate kinase promoter (PGK), hexokinasel, hexokinase2, glucokinase, pyruvate 
kinase, triose phosphate isomerase, phosphoglycerate isomerase. phosphoglycerate mutase, phosphofructose kinase 
or aldolase promoters may all be used. In short, any well expressed yeast promoter may work for expression of hemo- 

io globin in yeast. A wide variety of naturally occurring, regulated promoters could also be used, for example: GAL1-10, 
GAL7, PH05. ADHII have all been used to produce heterologous proteins in yeast. A variety of synthetic or semi-synthetic 
yeast promoters could also be employed such as GAL-PGK, GAL-MFa-1, GAL-MFal, GAL-SIGMA. ADHII regulatory 
sequences could also be coupled to strong transcriptional initiation sites derived from a variety of promoters. The PHOS 
regulatory sequence or the sigma element regulatory sequences could also be used to construct powerful hybrid pro- 

is moters. In addition to yeast promoters, it is conceivable that one could use a powerful prokaryotic promoter like the T7 
promoter. In this case, one could place the T7 polymerase under the control of a tightly regulated yeast promoter. Induc- 
tion of the phage polymerase in yeast cells bearing hemoglobin genes under T7 promoter regulation would allow tran- 
scription of the genes by this very efficient phage polymerase. 

Because most of the yeast regulatory sequences described above serve as targets for proteins that are positiv 

20 regulators of transcription, it is conceivable that these proteins may limit transcription in situations where the target 
sequence is present in many copies. Such a situation may obtain with vectors such as pC1 B, pCIT, pC1 U or pCl N which 
may be present in excess of 200 copies per cell. Over-expression of the positive regulator (for example GAL4) may result 
in enhanced expression. It is possible to construct a strain in which the GAL4 gene is altered to remove its promoter 
and the promoter replaced with the GAL7 or GAL1 -1 0 promoters, both of which are transcribed more efficiently than the 

25 GAL4 promoter. In this situation, the positive transcriptional activator protein GAL4 would be expressed at elevated level 
at the time hemoglobin expression was induced. 

The consensus sequence for higher eukaryotic ribosome binding sites has been defined by Kozack (Cell, 44:283- 
292 (1986)) to be: G^qCCAUGG. Deviations from this sequences, particularly at the -3 position (A or G), have a large 
effect on translation of a particular mRNA. Virtually all highly expressed mammalian genes use this sequence. Highly 

30 expressed yeast m RNAs, on the other hand, differ from this sequence and instead use the sequence AAAAAUGU (Cigan 
and Donahue, Gene, 59:1-18 (1987)). The ribosome binding site that we use for expression of the a and p-globins 
corresponds to the higher eukaryotic ribosome binding site. It is within the contemplation of this invention to systematically 
alter this RBS to test the effects of changes that make it more closely resemble the RBS of yeast. It should be pointed 
out, however, that alterations at the -2, -1 and +3 positions, in general, have been found to only slightly affect translation^ 

35 efficiency in yeast and in mammals. 

Intracellular expression of genes in S. cerevisiae is primarily affected by the strength of the promoter associated 
with the gene, the plasmid copy number (for plasmid-borne genes), the transcription terminator, the host strain, and the 
codon preference pattern of the gene. When secretion of the gene product is desired, the secretion leader sequence 
becomes significant. It should be noted that with multicopy plasmids, secretion efficiency may be reduced by strong 

40 promoter constructions. Ernst DNA 5:483-491 (1986). 

A variety of extrachromosomally replicating vectors (plasmids) are available for transforming yeast cells. The most 
useful multicopy extrachromosomal yeast vectors are shuttle vectors that use a full length 2ji -circle combined with an 
EL coli plasmid. These vectors carry genes that allows one to maintain the plasmid in appropriate yeast mutants and 
antibiotic resistance markers that allow selection in E^coN. Use of the full-length 2\x -circle, in contrast to vectors containing 

45 only a partial 2ji sequence, generally results in much higher plasmid stability, particularly in yeast strains that have been 
cured of endogenous 2jn plasmid. The pC series of vectors described herein are vectors of this type. 

Strains could also be constructed in such a way that the GALGAP hemoglobin expression cassettes were integrated 
into chromosomes by using yeast integrating vectors. Although the copy number of the hemoglobin genes would be 
lower than for plasmid vectors, they would be quite stable and perhaps not require selection to be maintained in the host 

so cell. Yeast integrating vectors include Yip5 (Struhl, etal, PNAS, 76:1035-39, 1989), Yip1 (Jd.), and pGT6 (Tchumperand 
Carbon, Gene, 1 0:1 57-1 66, 1 980). For information on these and other yeast vectors, see Pouwels, et al. , Cloning Vector. 
Vl-I, M seg. (Elsevier, 1985). 

The alpha and beta globin genes may be introduced by separate plasmids, or both upon the same plasmid. The 
advantage of a single plasmid system over a double plasmid system is theoretical. It is generally thought that there is 
55 an upper limit to the total number of plasmid copies per cell. If it is 1000, for example, the two plasmid system could 
have only 500 copies of a-chain plasmid and 500 of the p-chain plasmid. A single plasmid of 1 000 copies per cell would 
bear 1000 copies of each a- and p-chain gene. The number of copies may be irrelevant, however, if other factors are 
limiting. In fact, several groups favor using strains that contain genes integrated into various chromosomal loci. Such 
strains very stably maintain the foreign gene and do not require special media to maintain selection for the plasmid. 
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Highly expressed yeast genes show a very hi gh codon bias. The genes encoding glyceraldehyde-3-phosphate dehy- 
drogenase and ADH-t, for example, show a 90% bias for a set of 25 codons. Highly expressed yeast genes (>1% of the 
total mRNA) have yeast codon bias indices of >.90. Moderately expressed genes (0.1 -.05% of the total mRNA) have 
bias indices of 0 6-0 8. and genes expressed at low levels (>0.05% of the total cell protein) have a codon bias off 0.10- 

5 0.50 (Bennetzen and Hall. J &ol Chem., 257:3026-3031 (1982)). The calculated value for the codons of the human ct- 
globin cDNA is 0 23 A simiar value can be calculated for the p-globin cDNA. Because there is a very high correlation 
between the most commonly used codons, it is possible that hemoglobin expression from the human cDNA in yeast 
may be limned by the avaiiat>irty of the appropriate tRNA molecules. If this is so, a complete synthesis of the gene using 
the most higm> ta vexed yeast codons could improve the expression levels. It is quite possible that the greatest negative 

10 , effect of adverse codon use would be if there was an abundance of codons used in the cDNA that are represented by 
low abundance tFNAs m t>ucn a case, high level expression of hemoglobin could completely drain that pool of tRNA 
molecules, reducing translation not only of hemoglobin but of yeast proteins that happen to use that codon as well. In 
the case of the *i-giao»n human cDNA, the most commonly used leucine codon is CTG (14 of 21), this codon is never 
used in highly expressed yeast genes (Guthrie and Abelson, The Molecular Biology of the Yeast Saccharomyces. Eds. 

is Stratern, Jones and Broach. 1 002 Co4d Spring Harbor, NY). The low codon bias index and the presence of rare yeast 
codons in the glofcnn cON Ac novo boon sufficient incentive for us to synthesize a modified form of the alpha- and beta- 
globin genes using the pteterred ^east codons. 

"Pi-Alpha" and D» B eta* Hemogtob«no 

20 

The present invention Kir the* contemplates in some embodiments the combination of (a) one molecule of a di-alpha 
globin-like polypeptide with two molecules of a beta globin-like polypeptide to form a "di-alpha" hemoglobin-like protein; 
(b) two molecules of an alpha giobwviike polypeptide with one molecule of a di-beta globin-like polypeptide to form a 
"di-beta" hemoglobin -I ike protein, or (c) one molecule of a di-alpha globin-like polypeptide with one molecule of a di- 
25 beta globin-like polypeptide to form a "di-alpha/di-beta" hemoglobin-like protein. 

It should further be noted that the delta, gamma and epsilon chains have considerable homology with the beta chain 
and that the zeta chain has considerable homology with the alpha chain. Di-delta, di-gamma, di-epsilon add di-zeta 
polypeptides are the* eto e within the compass of the invention and may be used in the preparation of novel hemoglobins 
of types other than Hgb Al 

30 Preferably the di-alpha linker (rf one is used) consists of 1 to 3 amino acids which may be the same or different. A 
Mono-G!y linker is especially preferred. In designing such a linker, it is important to recognize that it is desirable to use 
one or more amino acids that will flexibly connect the two subunits, transforming them into domains of a single di-alpha 
globin polypeptide. 

The preparation of "di-beta" mutants is also contemplated. The distance between the N-terminus of one beta subunit 
35 and the C-terminus of the other is 18.4 A in the deoxy configuration and 5.2 A in the oxy form. Preferably, the di-beta 
linker consists of 4 to 9 amino acids which may be the same or different. 

The linker should be unlikely to form a secondary structure, such as an alpha helix or a beta-sheet. Certain amino 
acids have a greater tendency to participate in such structures. See Chou and Fasman, Biochemistry, 13:222-245(1974), 
incorporated by reference.. The amino acids are ranked in order of decreasing participation below. The preferred linker 
40 amino acids are boldfaced. Glycine is the most suitable amino acid for this purpose. The most preferred di-alpha linkers 
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Alpha Helix Formers 

Beta Sheet Formers 

Glu (1.53) 

Met (1 .67) 

Ala (1.45) 

Val (1 .65) 

Leu (1.34) Ha 

He (1.60) Hp 

His (1.24) 

Cys(1.30) 

Met (1 .20) 

Tyr(1.29) 

Gin (1.17) 

Phe (1.28) 

Val(1.14) 

Gin (1 .23) 

Trp(1.14) 

Leu (1.22) 

Phe (1.12) ha 

Thr (1 .20) 

Lys(1.07) 

Tro(1.19)hp 

lie (1.00) 

Ala (0.97) ip 

Asp (0.98) 

Arq (0.90) 

Thr (0.82) 

Q!y(0.81) 

Arg (0.79) 

Asp (0.80) ip 

Ser (0.79) 

Lys(0.74) 

Cvs (0.77) ia 

Ser (0.72) 

Asn (0.73) 

His (0.71) 

Tvr (0.61) ba 

Asn (0.65) 

Pro (0.59) 

Pro (0.62) bp 

Glv (0.53) Ba 

Glu (0.26) Bp 

(The letter symbols are Ha, strong aformer; ha. a former; la; weak aformer; ia, a indifferent; ba, a breaker; 
and Ba strong a breaker. The p symbols are analogous. Trp is bp if near the C-terminal of a p-sheet region.) 


The alpha helix of a polypeptide chain comprises an average of 3.6 residues per turn. In globular proteins, the 
40 average length is about 17A, corresponding to 1 1 residues or 3 helix turns. In alpha and beta globin, the helices range 
in length from 7 to 21 amino acids (A.A). The beta pleated sheet comprises 2.3 residues per turn; the average length is 
about 20A or 6 residues. 

Chou and Fasman define an alpha helix nucleus as a hexapeptide containing four helix forming residues and not 
more than one helix breaker, and a beta sheet nucleus as a pentapeptide containing three beta sheet forming residues 
45 and not more than one sheet breaker. 

The amino acid sequence in the vicinity of the di-alpha linker is as follows: 


residue # 

138 

139 

140 

141 

1 

2 

3 

4 

AA 

Ser 

Lys 

Tyr 

Arg -(XXX) n 

Val 

Leu 

Ser 

Pro 

Helix Not 

H21 

HC1 

HC2 

HC3 

NA1 

NA2 

Al 

A2 

Helix Pot 

079 

107 

061 

079 

114 

134 

079 

059 

Sheet Pot 

072 

074 

12 9 

0 90- — 

- 165 

122 

072 

062 


55 


19 

BNSDOCID: <EP 0700997A1J_> 


* 


EP 0 700 997 A1 


15 


20 


(Note: Helix- and sheet forming potentials have been 
multiplied by 100 for typographical reasons.) 


The di-alpha linker is preferably only 1 -3 amino acids. Thus, it can form an alpha helix only in conjunction with the 
linker "termini". A one or two residue linker, even if composed of amino acids with strong secondary structure propensities, 
w would be unlikely to assume an alpha helix or beta sheet configuration in view of the disruptive effect of, e.g., Arg 141 
or Ser 3. If the linker is 3 residues long, it would be preferable that no more than one residue be a strong alpha helix 
former, unless the linker also included a strong alpha helix breaker. 

The amino acid sequence in the vicinity of the di-beta linker may impose more stringent constraints. 


143 

144 

145 

146 

1 

2 

3 

4 

His 

Lys 

Tyr 

His - (XXX) 

- Val 

His 

Leu 

Thr 

H21 

HC1 

HC2 

HC3 

NA1 

NA2 

NA3 

Al 

124 

107 

061 

124 

114 

124 

134 

082 

071 

074 

129 

071 

165 

071 

122 

120 


The di-beta linker is longer (preferably 4-9 A. A.) and therefore more susceptible to secondary structure formation. 
It is desirable that the amino acid adjacent to Val-1 be an alpha helix breaker in view of alpha-helix propensities of Val- 
25 His-Leu. More generally, it is desirable that the linker not contain (or cooperate with the proximately linked amino acids 
to form) an alpha helix nucleus or beta sheet nucleus. 

Amino acids with a high propensity toward alpha helix formation may be used in the linker if accompanied by "helix 
breaking" amino acids. Similarly, Beta sheet formation may be prevented by "sheet disrupting" amino acids. 

Of course, prediction of secondary structure using Chou and Fasman's approach has its limitations and the ultimate 
30 test of the acceptability of a linker is whether or not the di-alpha or di-beta hemoglobin has the desired affinity for oxygen. 
In particular, a poly-alanine linker, despite its supposed propensity to alpha-helix formation, may well be of value since 
the alanine group is compact and therefore the linker should be quite flexible if secondary structure does not form. 

In an especially preferred embodiment, di-alpha and beta globin genes are combined into a single polycistronic 
operon. The use of a polycistronic operon is not, however, necessary to practice the present invention, and the alpha 
35 (or di-alpha) and beta (or di-beta) globin genes may be expressed from separate promoters which may be the same or 
different. 

While the preferred "genetically fused hemoglobin" of the present invention is one comprising a di-alpha and/or di- 
beta globin, other globin chains may be genetically fused and used in the production of hemoglobins of species other 
than Hgb Al (atfj). 

40 The appended claims are hereby incorporated by reference as a further enumeration of the preferred embodiments. 
All cited references are incorporated by reference to the extent necessary to enable the practice of the invention as now 
or hereafter claimed. 

By the means suggested in this specification, we have had the following achievements: 

45 (1) Expression of Met-FX-alpha globin in E. coli from a dicistronic operon comprising a introductory cistron and a 
Met-FX-alpha globin gene, both transcribed from a Tac promoter. (Example 2) 

(2) Expression of Met-FX beta globin in E. coli by similar means. (Example 2) 

so (3) Co-expression of Met-FX-alpha globin and Met-FX-beta globin in E. coli from a tetracistronic operon comprising 
an introductory cistron, an FX-alpha globin gene, an intercistronic sequence, a second introductory cistron, and an 
FX-beta globin gene, all controlled by a single Tac promoter. The alpha and beta globins were intracellular^ assem- 
bled into functional FX hemoglobin. The FX Hgb was enzymatically converted to Hgb. (Example 3) 

55 (4) Co-expression as in (3) above of mutants of FX-hemoglobin in which the beta globin subunits possessed the 
Beth Israel, Cheverly, Providence/MSR, Kansas or beta^ 7 Val -» lie mutations. (Example 4) 

(5) Co-expression as in (3) above of Met-alpha globin and Met-beta globin and intracellular assembly into Met-Hgb 
(a.k.a. Des-FX-Hgb). (Example 5) 
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(6) Co-expression as in (3) above of Des-Val-alpha globin and Des-Val-beta globin. (Example 6) 

(7) Co-expression as in (6) above, but with the Des-Val-beta globin gene preceeding the Des-Val-alpha globin gene 
within the operon. (Example 6) 

(8) Co-expression as in (3) above of (Des-Val)-alpha-(Gly-Gly)-alpha globin and beta-globin and intracellular assem- 
bly into a di-alpha Hgb. (Example 8) 

(9) Cci-expression as in (8) above of (Des-Val-alpha-(GlyGly) -alpha globin and a mutant (Nagai, Arg-Nagai, or Kan- 
10 sas) of beta globin and intracellular assembly into a mutant di-alpha Hgb. (Example 9) 

(10) Co-expresion as in (3) above of Des-Val-alpha globin and a Des-Val beta-globin with the Presbyterian mutation 
and intracellular assembly into a mutant Des-Val Hgb. (Example 11). 

is (11) Co-expression as in (8) above of a (Des-Val)-alpha-(GlyGly)-alpha globin and of a betal globin with the Pres- 

byterian mutation. (Example 11). 

(12) Expression of di-alpha globin and beta-globin from separate promoters on the came plasmid (Example 12). 

20 (13) Devised Hypothetical Protocol for Co-expression of di-alpha globin and two copies of beta globin from same 

operon (Example 13). 

(14) Devised Hypothetical Protocol for Co-produciton of Di-beta Hgb (Example 14). 

25 (15) Devised Hypothetical Protocol for co-expression of alpha globin and beta-globin under control of separate 

promoters on the same plasmid (Example 16). 

(16) Devised Hypothetical Protocol for co-expression of alpha and beta-globin under control of different promoters 
on different plasmids (Example 17). 

30 

(17) Co-expression of a and p globin in S. cerevisiae (Example 19). 

(18) Co-expression of di-a globin and p-globin in S. cerevisiae . (Example 23). 

35 (19) Construction of Des-Val-alpha globin and Des-Val-beta globin in E. coli under lambda P L control. 

(20) Co- Expression of di-alpha globin and Des-Val-beta globin in E. coli under lambda P L control. 

(21) Co-Expression of alpha and beta globin from separate plasmids in diploid strains of S. cerevisiae . 

40 

(22) Co-Expression of di-alpha globin and beta globin Presbyterian mutant in S. cerevisiae . 

(23) Preparation of vectors for expression of di-alpha globins with -Gly- or -Pro- linkers. 

45 (24) Evaluation of different strains and induction temperatures for expression of di-alpha Hgb in E, con. 

(25) Co-Expression of alpha and mutant beta globin in S. cerevisiae. assembling to form low-affinity Hgb mutants. 

An unexpected and surprising change in oxygen binding characteristics of hemoglobin was observed upon replace- 
50 ment of the N-terminal valine with methionine. As illustrated in Example 1 1 , hemoglobin purified from blood has a 
P 50 value of 4.03 with N=2.7. DesFX-hgb produced in B con, a hemoglobin identical to A<> except for the addition of a 
methionine at the N-termini of the alpha and beta chains, has essentially the same P 50 and N values. (Example 7). Thus, 
the addition of a methionine, without altering the adjacent valine residue, has little or no effect on oxygen binding. On 

the other hand , a two-fold higher P 50 value. 7.04, was observed for desVal-hgb produced in E, coli. a hemoglobin in 

55 which the normal N-terminal valine of each chain was replaced with methionine. Cooperative as measured by N. was 
the same, however, for this molecule as for the others. A similar comparison was made for two hemoglobins each con- 
taining fused alpha chains and each containing the Presbyterian mutation, one produced in E. coli (Example 11) and 
one in yeast. The E. poli hemoglobin was constructed with a Des-Val alpha chain, i.e., the N-terrninus had the normal 
valine replaced with methionine. Oxygen binding was characterized by P 50 =33 and N=2.4 (Example 11). The corre- 
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sponding yeast coding region begins with an additional methionine codon in front of the normal valine codon. Because 
this initial methionine is removed post translationally in vivo, the purified hemoglobin has a normal N-terminal valine. 
For this molecule, P 50 =23 to 25 and N=2.5. Thus, in two quite different cases, the replacement of an N-terminal valine 
with an N-terminal methionine increased the P 50 value. Under physiological conditions, it is expected that the fused 
5 Presbyterian hemoglobin produced in E^coN will deliver 20-30% more oxygen than the similar hemoglobin, with its altered 
N-terminus, produced in yeast. 

A very large number of different plasmids are referred to in the Examples which follow. In order to highlight the 
relationships among these plasmids, a Table of vectors has been compiled (See Table 100). 

io Example 1 : Construction of FX-alpha Globin (pDLH-62m) and BetaGlobin (pDLII-10a) Expression Vectors 
Materials and Methods 

Unless otherwise stated all electroelutions, phenol/chloroform extractions, ethanol (EtOH) precipitations, alkaline- 
rs SDS plasmid purifications, agarose electrophoresis and DNA manipulations were carried out essentially as described 
by Maniatis el al ("Molecular Cloning" Cold Spring Harbor, New York, 1 982). 

The following abbreviations and definitions are used: ethylenediaminetetraacetic acid (EDTA); sodium dodecylsul- 
fate (SDS); polyacrylamide gel electrophoresis (PAGE); dimethylsulfoxide (DMSO); dithiothreitol (DTT); isopropyl- beta - 
D-thiogalactopyranoside (IPTG); 2xYT medium (16g bacto tryptone, 10g Bacto yeast extract, 5g NaCI per (iter water); 
20 SDS-PAGE loading buffer (0. 1 25M Tris-HCI, pH6.8, 20% v/v glycerol, 2% SDS, 2% 2-mercaptoethanol, 0.01 % bromphe- 
nol blue); phosphate buffered TB medium (24g yeast extract, 12g tryptone, 4 ml_ glycerol, 17mL 1M KH 2 P0 4 , 72 mL 
1MK 2 HP0 4 per liter water); Tris-EDTA buffer (TE: 10mMTris, pH8.0, ImM EDTA); Tris-borate, EDTA buffer (TBE: 0.089 
M Tris, 0.089 M boric acid, 0.002 M EDTA); Tris-acetate EDTA buffer (TAE: 0.04 M Tris, 0.04 M acetic acid, 0.001 M EDTA). 
Protein electrophoresis was performed by the method of Laemmli, U.K. (Nature 1970, 227. 680-685) on 15% SDS- 
25 polyacrylamide gels. 

E.cqli JM109 cells were made competent as follows: Two hundred milliliters of 2xYT medium was inoculated with 
2mL of an E.coli JM109 overnight culture. The 200mL culture was then incubated with shaking at 37°C for 1 hour. Cell 
were harvested by centrifugation at 6000 rpm at 4°C for 4 min in a Beckman JS13.1 swinging bucket rotor (Beckman 
Instruments, Inc., Palo Alto, CA). The cells were resuspended in 50mL total of a buffer containing 45mM MnCI2, 60mM 

30 CaCI 2 , 40mM KOAc. pH 6.2, 15% sucrose (w/v) 1.3% RbCI (w/v), and 7.5% (v/v) glycerol. Following centrifugation as 
above, the cells were resuspended in 20mL of the same buffer and incubated at 0°C for 30 minutes. Cells were dispensed 
in one milliliter aliquots and stored at -80°C until used. 

Unless otherwise stated, all restriction enzyme digests were performed under conditions suggested by the manu- 
facturer. DNA concentrations wore determined by Beer's law using measured absorbance at 260 nm (A260) against a 

35 water reference standard. For synthetic oligonucleotides the following extinctions were used: E260=o.05mL/ug/cm and 
for double stranded DNA E26o=o.02 mUug/cm. 

Globin Gene Synthesis 

40 Each globin gene was constructed from 14 separate oligonucleotides ranging in length from 50-85 base pairs. The 
FX-alpha globin gene was synthesized from oligonucleotides SJH l-33a-f, SJH l-34a-f and SJH l-35a,b; FX-beta globin 
gene was synthesized from SJH l-36a-f. SJH l-37a-f and SJH l-38a,b. Each globin gene is preceeded by a short loader 
gene as previously described. Oligonucleotides were synthesized on a Biosearch 8600 instrument using beta-cya- 
noethylphosphoramidite chemistry on 1,000 angstrom CPG columns (0.2 mmole) (Beaucage. S.L. and Caruthers, M.H. 

45' Tet. Lett. 1 981 , 22, 1 859-1 862). The sequence of these oligonucleotides is given in Figure 4. 

Unless otherwise stated, the oligonucleotides were cleaved from the columns using the following protocol: Approx- 
imately 0.5mL of fresh, concentrated NH 4 OH was drawn into the column with a 1 mL syringe. The NH 4 OH was allowed 
to react for 20min then expressed into a glass vial (approximately 4mL capacity). This process was repeated 2 times, 
the vial was filled to greater than 75% capacity with NH 4 OH, and heated at 55°C, overnight. The samples were lyophilized, 

so resuspended in 0.1 mL H 2 0 and the concentration estimated by measuring A^q. 

Two hundred micrograms of the individual oligonucleotides were purified by urea polyacrylamide gel electrophoresis. 
To do this, an equal volume of 2x loading buffer (90% formamide (v/v), O.SxTBE, 0.05% (w/v) bromophenol blue, 0.05% 
(w/v) xylene cyanol) was added to the oligonucleotide. The sample was heated at 95°C for 1 0 min. and applied to a 1 0% 
acrylamide gel containing 7M urea and 1 XTBE. Electrophoresis was at 800 volts for approximately 2 hrs. The full length 

55 oligonucleotide was visualized under ultraviolet light. That region of the gel was then excised and incubated in 3mL of 
100mM Tris, pH 7.8, 500mM NaCI, 5m M EDTA buffer at 60°C, overnight. 

The oligonucleotide solution was further purified by reverse phase chromatography as follows: A C18 Sep-Pak 
cartridge (Waters Associates) was washed with 10mL 100% methanol followed by 10mL of H2O. The oligonucleotide 
solution was applied to the column, washed with 20mL H2O and eluted with 3x1 mL aliquots of 50mM triethylammonium 
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acetate, pH 7.3/methanol (1:1). The purified oligonucleotide was lyophilized, washed with 100% ethanol, dried, and 
resuspended in 0.1 mL H 2 0. The concentration was determined by A 2 60- 

The synthetics FX-beta gene sequence (included in Figure 5) was constructed as follows: 1 0Opmole of the following 
oligo nucleotides were kinased in 3 separate reactions. Reaction 1 contained oligonucleotides SJH l-36b, c, d, e, and f. 

5 Reaction 2 contained SJH l-37a, b, c, and e. Reaction 3 contained SJH l-37d, f. and SJH l-38a. After combining the 
appropriate oligonucleotides, the solutions were lyophilized to dryness and resuspended in 16uL of H2O. Two uL of 10x 
kinase buffer (0.5M Tris-HCI, pH7.4, 0. 1 M MgCy, 0.5 uL of 100mM DTT, and 1 uL of T.OmM ATP were then added. The 
reaction was initiated by addition of 1uL (2U) of T4 polynucleotide kinase (IBI, Inc., New Haven, CT). After incubation 
at 37°C for 1 hour, the reactions were heated to 95°C for 1 0 minutes to inactivate the kinase. The three reactions were 

10 combined and 100pmoles of oligonucleotides SJH l-36a and SJH l-38b were added. After addition of 10uL of 100mM 
Tris, pH 7.8, lOOmM MgCI 2 , the oligonucleotides were allowed to anneal by incubating at 65°C for 30 min, 37°C for 
30min, and 15°C for 1 hour. Annealed oligonucleotides were ligated by addition of ATP fl mM, final) and DTT (10 mM 
final) and 4uL (20U) T4 DNA ligase (IBI, Inc., New Haven, CT) and incubation at 1 5°C for 1 hour. Aliquots of this ligation 
mixture were then cloned directly into M13mp19 (see below). 

15 Oligonucleotides for the construction of FX-alpha globin were similarly purified, kinased, annealed, and ligated. 
Before ligation into M13mp1 9, the full length FX-alpha globin gene was purified by electrophoresis through 0.8% agarose 
in 1XTAE buffer and electroeluted into dialysis tubing using 0.5X TBE as the electroelution buffer by the method of 
Maniatis et aj. Eluted DNA was phenol extracted, EtOH precipitated, and resuspended in 20uL TE buffer. Aliquots were 
used for cloning into M13mp19. 

20 

Phage Vectors 

For cloning, a 2, 5 or 10 fold molar excess of the individual FX-alpha and FX-beta globin gene sequences were 
combined with 200ng of double cut. gel purified M13mp19-RF (New England BipLabs, Inc.. Beverly, MD) and ligated 

25 overnight at 15°C in 50uL ligation buffer (IBI. Inc., New Haven. CT) containing 2U of T4 ligase. FX-Alpha globin was 
cloned into the Xma l/Pstl sites of M13mp19. FX-Beta globin was cloned into the Pstl/Hindlll sites of M13mp19. 

E.coli JM109was transformed with the M13mp19 ligation mixture containing the FX-alpha or FX-beta globin gene 
sequences using the following transformation protocol: Nine microliters of DMSO was added to 0.25mL of competent 
E.coli JM109 and the cells were incubated on ice for 10min. Aliquots of the FX-alpha or FX-beta globin ligation reactions 

30 were added and incubated on ice for 40 minutes and at 42°C for 3 min. One hundred microliters of a JM109 overnight 
culture was added to each transformation mix followed by 60}iL of a solution containing 50jil_ of 2% (w/v) 5-bromo-4- 
chloro-3-indolyl galactopyranoside in dimethylformamide and 10jxL of 100mM isopropylthiogalactoside. Molten B-top 
agar (10g Bacto tryptone, 8g NaCI, 6g agar per liter), 2.5mL was added and the mixture poured onto a B-bottom agar 
plate (1 Og Bacto tryptone, 8g NaCI, 1 2g of agar per liter). Following incubation overnight at 37°C colorless plaques (i.e. , 

35 clones containing inserted DNA) were removed from the plates using sterile transfer pipettes and inoculated into 1ml_ 
of a JM109 overnight culture diluted 1 :100 in 2xYT media. 

Bacteriophage clones were grown at 37°C for 6 to 8 hrs. and centrifuged in a microcentrifuge for 5 minutes. The 
cell pellets were processed for M13mp19 RT using the alkaline-SDS method and resuspended in 20i*L of TE buffer 
containing 20ng/mL DNAasefree RNAase (Sigma. St Louis, MO.). Aliquots (1-3uL) of the RF preparations were digested 

40 with 1 0-20 units each of the appropriate restriction enzymes (see above) and analyzed on 0.8% agarose electrophoresis 
gels (see above). 

M13mp1 9 clones containing the correct size inserts were grown in larger quantities to obtain single stranded phage 
DNA for sequencing. Thirty-five milliliters of 2xYT medium was inoculated with 0.3mL of a JM 1 09 overnight culture. After 
growth for 1 hour at 37°C, the 35mL culture was inoculated with 200nL of the appropriate phage-containing supernatant. 

45 The culture was incubated for 6 hours at 37°C and the culture supernatant collected by centrifugation at 9000 rpm for 
10 min in a JS13.1 rotor. The phage ware precipitated from 31 mL of supernatant by addition of 5mL 4M NaCI and 4mL 
40% (w/w) polyethylene glycol 6000 and recentrifuged as above. The phage pellet was resuspended in 0.4mL of TE, 
extracted with phenol/chloroform/isoamyl alcohol (50:49:1 (v/v)) three times, chloroform/isoamyl alcohol (49:1 (v/v)) one 
time, and ethanol precipitated. The DNA pellet was resuspended in 20fiL TE and quantitated spectrophotometrically by 

50 A^o- One microgram of phage DNA was used per set of sequencing reactions. 

Sequencing was by the dideoxy method of Sanger (Sanger, F.S. et El Proc. Nat. Acad. ScL. USA 1977, 74, 5463- 
5467) with M13 -20 and M13 -40 universal primers (New England BioLabs. Beverly, MA) and gene specific primers (for 
FX-alpha globin the following primers were used: alpha-1 5'-CGTATGTTCCTGTCTTT-3*; alpha-2 5-ACAAACTGCGT- 
GTTGAT-3'; for FX-beta globin the following primers were used: beta-1 5'-GCTGGTTGTTTACCCGT-3'; beta-2 5'- 

55 ACCCGGAAAACTTCCGTC-3'). 
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Expression Vectors pDL ll-62m and pDL 11-1 0a 

The appropriate sequences were excised from the M 1 3mp1 9 vector and cloned into the pKK-223-3 (Pharmacia/LKB, 
Piscataway, NJ)) expression vector under control of the Tac promoter (Brosius, J. and Holy, A. Proc. Nat. Acad. ScL 

s USA 1 984, 81, 6929-6933). A DN A sequence encoding alpha globin was removed by cutting with EcoRI and Pstl. The 
globin containing fragment was gel purified and ligated into EcoRI and Pstl double cut, gel purified pKK-223-3 using 
methods described above. E.coli JM109 cells were transformed with the ligation reaction containing the desired FX- 
alpha globin sequence (pDL ll-62m) (Figure 1) and selected for by growth on 2xYT media containing ampicillin (100 
ng/mL). Individual clones were screened for the presence of the desired insert (yielding pDL ll-62m) by alkaline-SDS 

10 purification of plasmids and restriction analysis with the enzymes EcqRI and Psjl using the methods described above. 
Cloning of the FX-beta globin sequence into the pKK-223-3 expression vector, yielding pDL 11-lOa (Figure 1) was done 
analogously except that restriction analysis and cloning was done with Pstl and Hind lM. 

Example 2: Separate Expression of Synthetic FX-Alpha and FX-Beta Globin 

15 

To assess the expression of the individual FX-globin gene products, E.coli JM109 clones transformed with either 
pDL ll-62m or pDL IMOa were inoculated into 2mL of TB media containing ampicillin (lOOjig/mL). The inoculum was 
grown at 37°C for 3-4 hours, then divided into two 1mL aliquots. One of the aliquots was induced by the addition of IPTG 
(1mM, final) and grown for an additional 3-4 hours. The cells were collected by centrifugation, resuspended in O.SmL 
20 SDS-PAGE loading buffer and heated at 85°C for 1 0 minutes. The total cell protein mixture was electrophoresed on 1 5% 
SDS-polyacrylamide gels using authentic hemoglobin as a molecular weight standard. 

Example 3: Polvcistronic Coexpression of FX-al pha and FX-beta Globin Gene Products from the same Operon (PDLII- 
66a) And Conversion of FX-Hemoalobin to Hemoglobin 

25 

To achieve coexpression of FX-alpha and FX-beta globins from a single polycistronic operon, the FX-beta globin 
sequence from pDL 11-1 0a was excised with Hind lll and Pstl, gel purified, and ligated into Pst I /Hind I li cut and gel purified 
pDL ll-62m. Ligation and transformation conditions were identical to those described above. Note that each globin cistron 
was preceeded by an "introduction" cistron as previously described, so that the entire Tac promoter driven operon had 

30 four cistrons. Clones were individually examined for the presence of both FX-alpha and FX-beta globin genes by digestion 
of plasmids with EcoRI and separation of fragments by electrophoresis through 0.8% agarose. Plasmids containing both 
genes (pDL ll-66a, Figure 1) produced a fragment of approximately 1 .Okb following EcoRI digestion. 

Clones containing pDL ll-66a were grown in 2mL of TB media containing ampicillin (100fxg/mL) for 4 hours at 37°. 
The culture was divided into two 1mL aliquots, one of which was induced with 1 mM IPTG. Incubation was continued for 

35 4 hours. Total cell protein extracts for both the uninduced and induced clones were examined by SDS-PAGE electro- 
phoresis to confirm the coexpression of both FX-alpha and FX-beta globin. 

To determine if the coexpression of both gene products resulted in the formation of tetrameric FX-alpha globin 2 FX- 
beta globin 2 protein, the following experiments were performed. Two liters of TB medium containing ampicillin (100iig/ml) 
was inoculated with 20m L of an overnight culture of an FX-alpha/FX-beta expressing E± coH clone and grown to an 

40 optical density at 600nm (OD 600 ) of 2.1 at 37°C. The culture was induced with IPTG (2.5mM final concentration) and 
grown to an ODeoo of 3.5. 

The cells (40gm) were collected by centrifugation at 10,000xg and suspended in 80mL of lysis buffer (50mM Tris- 
HCI, pH 8.0, 25% sucrose, 1 nM EDTA). Ten milliliters of lysozyme solution (18mg/ml in lysis buffer) was added and the 
mixture incubated on ice for 30 min. MgCI 2 , MnCI 2 , and DNAse I (Sigma, St. Louis, MO) were added to final concentra- 
45 tions of 10mM, 1 mM and 10fig/mL, respectively. The cells were incubated at room temperature for 1 hour and an equal 
volume of a solution containing 1% one percent deoxycholic acid, 1% Nonidet P-40, 20mM Tris-HCI pH 7.5, 2 mM EDTA 
was added to the lysate. 

Particulate material was removed by centrifugation at 1 0,000 x g for 1 0 min. The supernatant (~200mL) was bubbled 
with carbon monoxide for 5 min and dialyzed overnight against 4 liters of 10mM NaP0 4 buffer, pH 6.0. The cell-free 
so extract was clarified by centrifugation at 10 t 000xg for 10 min. To the supernatant was added 20g DE-52 (Whatman, 
U.K.). The pH of the suspension was adjusted to 7.0 and the ion exchange resin was removed by centrifugation. The 
pH of the supernatant was readjusted to 6.0 and the supernant was loaded onto a CM-cellulose column (2.5 x 15cm) 
equilibrated in 10mM NaP0 4 . pH 6.0 at 4°C. The column was washed with two bed volumes of 10mM NaP0 4 , pH 6.0 

followed by a linear gradient of 10rriM NaP0 4 , pH 6.9 to 20mM NaP0 4 , pH 9.0 (400mL total volume). Fractions 36 to 

55 42 contained a red solution and were combined; an aliquot of this solution was scanned from 650nm to 400nm revealing 
a spectrum identical to that for carboxyhemoglobin (Figure 6). An aliquot of the same peak was analyzed by SDS-PAGE 
electrophoresis with hemoglobin as molecular weight standard and was found to contain two protein bands of approxi- 
mate MW 15,500 and 16,200. As expected, these bands migrated at a slightly slower rate than authentic hemoglobin 
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(alpha MW=15,100; beta MW= 15,850) presumably due to the five amino acid extension of the Factor X recognition 
sequence. On this basis the material was designated FX-hemoglobin. 

FX-Hemoglobin (2.0mg) was digested at room temperature for 2 hours in 3mL of 20mM HEPES, pH 7.4, 0.1 M 
NaCl, 10mM CaCI 2 containing 2mg of trypsin (Sigma, St. Louis, MO., 10,000 units/mg). SDS-/PAGE electrophoresis 
5 confirmed the conversion of FX-hemoglobin to material that comigrates with native hemoglobin. 

Example 3A : Distribution of FX-Globin Products from E.coli Expression 

One hundred milliliter cultures of E, co|i clones expressing Fx-alpha globin (plasmid pDL ll-62m), Fx-beta globin 
io (plasmid pDL 11-1 0a) and Fx-hemoglobin (plasmid pDL ll-66a) were started with 1 ml inocula of overnight cultures. After 
growth for 4 hours, protein expression was induced by addition of IPTG to 1 mM final concentration. Incubation was 
continued for an additional 3 hours. The cells were collected by centrifugation, weighed, and lysed as described above 
except that DNAase treatment was done for 30 minutes on ice. The samples were centrifuged at SOOOxg, 10 min and 
the supernatants (representing the soluble protein fraction) brought to 2 ml final volume with H 2 0 and frozen at -80°C. 
is The pellets (representing the insoluble protein or inclusion body fraction) were washed twice in 5 mL 0.5% Triton X-1 00, 
1 mM EDTA, resuspended in 2 ml H 2 0 and frozen at -80°C. 

Analysis of protein distribution was accomplished by SDS-PAGE and Western blotting. The primary antibody was 
rabbit anti-human hemoglobin IgG. The western blotting protocol was according to the manufacturer's recommendations 
(Proto Blot Western Blot AP system, Promega Corp., Madison, Wl). Samples of soluble and insoluble protein represent- 
ee ing 60 fig of wet cell weight were analyzed from the FX-alpha and FX-beta expressing clones. Due to the greater level 
of expression in the FX-hemoglobin clone, material representing only 1 5 \iq of wet cell weight was analyzed. 

As seen in Table 1 1 the distribution of the proteins varied. Fx-alpha globin was detectable only in the soluble fraction 
while FX-beta globin partitioned between the insoluble and soluble fractions of the cel.. FX-Hgb, which stabilizes each 
separate subunrt, was found only in the soluble fraction, and at a concentration at least 2.6 times that of the individually 
25 expressed subunits. These results indicate that FX-beta globin is totally soluble only when allowed to assemble with FX- 
alpha globin. 

Example 4: Construction And Expression of Mutant FX-Alpha/FX-Beta Tetrameric Hemoglobin Expression Vector 

30 Hemoglobin Beth Israel : pDL IMOa was digested with restriction enzymes Sacl and Spe l. gel purified, and isolated 
by electro-elution. Oligonucleotides incorporating the appropriate codon change for Hemoglobin Beth Israel (beta 1 ^ 
asn->ser) (Figure 7) were synthesized as previously described above to bridge the Sacl to Spe l restriction sites. Puri- 
fication and quantitation of the individual oligonucleotides was as previously described. The complementary oligonucle- 
otides were annealed by heating to 95°C for 10 min. followed by slow cooling to room temperature over a 2 hour period. 

35 An aliquot of the annealed mixture was then combined with the Sacl /Spe l digested, gel purified pDL 11-1 0a plasmid at 
molar ratios similar to those used for the initial FX-alpha and FX-beta cloning. T 4 DNA iigase (2U) was added and the 
mixture incubated for 1 hour at room temperature. Ecgji JM1 09 was transformed with this ligation mixture as previously 
described. Individual clones were isolated, and plasmids purified were sequenced using primer beta-1 (vida supra) . 
Plasmid sequencing was done with a Sequenase kit (United States Biochemical Corp., Cleveland, OH) following the 

40 protocol supplied by the manufacturer. The appropriate mutated beta globin sequence was then excised with Hind lll 
and Pst l. gel purified, and cloned into pDL ll-62m as described above. 

Other hemoglobin mutants : The synthetic genes encoding Hemoglobin Cheverly (beta 45 phe~>ser) Hemoglobin 
Providence/MSR (beta 82 lys-->asp) and Hemoglobin beta 67 val~>ile and Hemoglobin Kansas (beta 1 02 asn~>thr) were 
prepared similarly except with synthetic oligonucleotides spanning the Sac 1 1 -> Ball I, Sai l --> Spe l, Nco l--> Kpn l and Sacl 

45 -->Spel restriction sites respectively (Figure 7). Synthesis of the mutant oligonucleotides, restriction enzyme digestion, 
gel purification, and ligation conditions were identical to those used for Hemoglobin Beth Israel. All mutations were first 
cloned into plasmid pDL IMOa, appropriate clones were sequenced, and the mutated beta globin gene was subcloned 
into Pstl and Hind lll digested pDL ll-66a. Plasmid sequencing was accomplished as described previously. E. cold cells 
were transformed, cultured, and induced as previously described. FX-hemoglobin mutants were purified by the method 

so of Example 3. Oxygen binding of purified hemoglobin mutants is shown in Table 9. 

Example 5: Production of Synthetic Met-Alpha/Met-Beta Hemoglobin 
- Construction of Des-FX Alpha and Des-FX Beta Globin Genqs - 

55 

We established that E. coli could produce tetrameric fusion-hemoglobin. Elimination of the DNA sequences coding 
for the Factor X a substrate recognition site on the N-terminai end of each peptide should result in production of synthetic 
(FX free; "des-FX") hemoglobin containing the N-terminal methionine as its only extra residue. pDL ll-62m was digested 
with Eco RI and P§ll to excise the sequence containing the FX-alpha globin gene. The FX-alpha globin gene was then 
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gel purified. pGEM-1 (Promega Corp.) was linearized with EcoRI and Pstl, gel purified, and the FX-alpha giobin gene 
ligated into the plasmid as described previously. Clones (FX-alpha pGEM) containing the FX-alpha giobin gene in pGEM- 
1 were identified by digestion of purified plasmids with EcoRI and Psil followed by agarose gel electrophoretic analysis. 
FX-Alpha pGEM was digested with Nde l and Eagl to remove the Factor X 8 coding sequence (Figure 5). Oligonucleotides 

5 containing the DNA sequence encoding native alpha giobin were synthesized with ends compatible to Nde l and Eagl 
(Figure 8) restriction sites. After synthesis, the oligoncleotides were purified, annealed, and ligated as described above. 
The sequence of pGEM-desFX-alpha (pDL ll-83a) was confirmed by dideoxy-sequencing of the plasmid using the T 7 
promoter primer (Promega Corp., Madison, Wl). A clone containing the correct sequence was then digested with Eco RI 
and Pstl The des-FX alpha giobin gene was gel purified, cloned into EcoRl/Pstl digested, gel purified pKK-223-3 to 

w generate pDL ll-86c. E.coli strain JM109 was transformed with the ligation mixtures and expression of des-FX alpha 
giobin by individual clones wasdetermined by analysis of total cell protein extracts of induced culture inocula (see above). 
DesFX alpha giobin, in contrast to FX-alpha giobin, co-migrates with authentic alpha giobin on SDS-PAGE. 

The des-FX beta giobin sequence was prepared in an analogous fashion using the FX-beta giobin gene excised 
from pDL ll-lOa with Psil and Hind lll. This gel purified beta giobin sequence was then ligated into pGEM-1 that had been 

is digested with the same two enzymes. A pGEM-1 plasmid containing the FX-beta giobin gene was digested with Nde l 
and SapH, gel purified, and used for construction of the des-FX beta giobin gene. The oligonucleotides conferring the 
desired sequence (Figure 20) were synthesized, purified, annealed, and ligated into Nde l. Sac li cut pGEM FX-beta to 
form pGEM-des-FX beta (pDL III-6f) as described above. After confirming that the sequence was correct the des-FX 
beta gene was removed with Pstl and Hind lll and gel purified. Des-FX beta was then ligated into the des-FX alpha- 

20 containing plasmid pDL ll-86c using the Pstl /Hind lH sites. Clones containing both des-FX alpha and beta giobin genes 
(pDL 111-1 3e) (Figure 9) were confirmed by Eco RI digestion of purified plasmids (see above), and screened for expression 
by comparison of IPTG induced and non-induced cultures. Dec-FX hemoglobin co-migrates with native hemoglobin on 
SDS-PAGE. 

25 Characterization of Met-Hab (Des-FX Hab) 

Recombinant methionyl-Hgb has different reactivities than Hgb Ao in the presence of chloride and phosphate ions 
(Table 1 0) and with changes in hydrogen ion concentration (Figure 1 0). The reason for this is thought to be the additional 
amino acid, methionine, on the N-termini of both globins. The N-terrninal amino group of the alpha chain is important 

30 for the change in P 50 by phosphate ion. Displacement of the N-terminal amino group in space or changing its electronic 
state by the addition of the methionine can alter these effects. 

The increase in P 50 seen in the presence of inositol hexaphosphate ion is irrelevant for any Hgb found in solution 
because the concentration of monophosphate ion found in plasma is not enough to significantly increase the P 50 . There 
is no inositol hexaphosphate ion found in plasma. The increase in P 50 needed for Hgb in solution to effectively off-load 

35 oxygen can be best attained by incorporating mutations to make Hgb lower in affinity for oxygen. 

The magnitude of change in P 50 with respect to chloride ion is not important physiologically. The effect, which is 
interesting in terms of its biochemical mechanism, does not add a significant amount of oxygen off-loading capacity to 
Hgb in solution. Some animal Hgb have extremely small chloride effects. Again, the chloride effect is thought to be 
transmitted by the alpha giobin N-terminal amino group; with methionyl-Hgb that effect could be changed because of 

40 the stearic change or because the pKa of the methionine amino group is different than that of valine. 

The P 50 of Hgb is normally most dramatically changed by hydrogen ion concentration. The so-called "Bohr effect" 
is also thought to, in part, involve the N-terminal amino group of the alpha giobin. Numerous mutant human Hgb molecules 
have been shown to have altered Bohr effect changes without any physiological deficiency. There is an advantage in 
having a limited Bohr effect, as well as phosphate and chloride effects, in a Hgb molecule to be used in solution for a 

45 number of different medical and biochemical applications. In terms of the Bohr effect, there are several applications for 
methionyl-Hgb where it could be used in the more alkaline pH range, e.g., tissue culture, organ perfusion. Figure 10 
depicts a preliminary experiment which indicates that the P 50 is actually greater for methionyl-Hgb than for Hgb Ao at 
greater than pH 7.8. It will be noted from Figure 10 that the slope of the P 50 -to-pH plot for Met-Hgb is shallower than 
that for Hgb Ao, i.e. the Bohr effect is smaller. Subsequent experiments suggest that the difference in Bohr effect between 

so Met-Hgb and Hgb Ao is smaller than that shown in Figure 10. 

The main advantage for using a Hgb molecule with fixed changes in P 50 relative to pH, chloride and phosphate, is 
that the practitioner will know the oxygen off-loading capacity of the formulation without regard for the specific conditions 
of its use. 
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Example 6 Synthesis of Synthetic Hemoglobin (Des-Val Hqb) of Native Size 
Constru^ion cf Pes- Va'- Alpha (pDL II-91A and Des-Val-Beta (dDL HMa^ Globin Genes 

5 DNA sequences encoding the globin genes in which the N-terminal valine codon in each gene is replaced by an 

ATG (methionirie) codon we* e constructed in a manner analogous to the des-FX clones except that the oligonucleotides 
inserted (F»gure £) we<e ones encoding the amino acid sequences "met-leu..." and "met-his..." for alpha and beta globin 
genes, respectrvety Following confirmation of correct sequence for both des-val alpha (pDL 11-91 f) and des-val beta 
(pDL ll-95a) genes in pGEM-i. the des-val alpha globin gene was cloned into the EcoRI/Pstl cut pKK-223 (see above) 

io to create pOL m la The des-val beta globin gene from pDL ll-95a was then cloned into pDL lll-1a using the Pstl /Hind lll 
restriction sites 

More soecrfcaB y the oes-val alpha transfer vector was prepared from plasmid pDL ll-62m as follows. Plasmid pDL 
ll-62m was d»gested wrm EcoRI and Pstl to excise the fragment containing the FX alpha globin gene. The FX alpha 
globin gene was men ge< pui rf ied The plasmid pGEM-1 (Promega Corp.) was linearized with EcoRI and Pstl, gel purified, 

is and the FX at^wt gkjum uvh? iioated into the plasmid as described previously. It was necessary to subclone into pGEM- 
1 because redundant itrstt ictKxi sites in pKK 223-3 prohibited the removal of the FX coding sequence directly from the 
individual FX-aipha and rx beta globin genes. Clones containing the FX alpha globin gene in pGEM-1 were identified 
by digestion of jx#*f i*d piavrxis with EcoRI and Pstl followed by agarose gel electrophoretic analysis. FX alpha pGEM- 
1 was digested witn Ndel and Eagl to remove the FX alpha coding sequence and oligonucleotides containing DNA 

20 sequences coding to* native alpha globin in which the N-terminal valine is replaced by a methionine were synthesized 
with ends compar»b*e to Ndel and Eagl restriction sites. After synthesis, oligonucleotides were purified, annealed and 
ligated as descnbed above The sequence of pGEM des-val alpha (plasmid pDL li-91f) was confirmed by dideodoxy 
sequencing of the plasmid using the T 7 primer (Promega Corp., Madison, Wisconsin). A clone containing the correct 
sequence for des-va? alpha (pOL M-91f) was then digested with EcoRI and Pstl. The des-val alpha globin gene was gel 

25 purified, and ligated into EcoRI/Pstl digested, gel purified. pKK-223-3 to generate plasmid pDL IIMa. 

The des-val beta giobm transfer vector was prepared in an analogous fashion using the FX beta globin gene of 
plasmid pDL Il lOa FX-beta was excised from pDL ll-lOa with Pstl and Hindi. This gel purified beta globin sequence 
was then ligated tnto pGEM-i that had been digested with the same two enzymes. A pGEM clone containing the FX 
beta globin gene was digested with Ndel and Sacll, gel purified, and used for construction of the des-val beta globin 

30 gene. The oligonucleotides encoding the desired sequence for des-val beta were synthesized, purified, annealed, and 
ligated into Ndel. Sacll cut. pGEM FX-beta to form pGEM des-val beta (plasmid pDL ll-95a). After confirming that the 
sequence was correct for des-val beta globin, the gene was removed with Pstl and Hindlll and gel purified. 

Preparation of pD L HI- 14 c and Hl-38b (des-Val-alpha/des Val beta Polycistronic Gene Clones) 

35 , 

Plasmid PDL Ill-la containing the Des-Val alpha globin gene was digested with Pstl and Hindlll and gel purified. 
The Des-Val beta globin gene was removed from pDL ll-95a using the same method. Following ligation and transfor- 
mation, individual clones containing the Des-Val alpha/des-val beta globin coexpressing plasmid pDL ttl-1 4 c were ana- 
lyzed for Des-Val hgb production by IPTG induction and SDS-PAGE. 
40 Plasmid pDL lll-38b which contains the Des-Val beta globin gene 5* to the Des-Val alpha globin gene was then 
constructed and analyzed. 

Plasmid pDL Ill-la containing the des-val alpha globin gene was linearized with the restriction enzyme Sma l. The 
plasmid was then treated with bacterial alkaline phosphatase to remove the S'-phosphate groups, phenol extracted, 
ethanol precipitated, and resuspended in TE buffer as above. A pGEM-1 clone containing the des-val beta gene was 

45 digested with Hind lll. phenol extracted, ethanol precipitated and resuspended in a ligation mixture containing a 50:1 
molar ratio of a Hind lll- Sma l linker to plasmid. This ligation mixture was then digested with Sma l and the beta globin 
fragment now containing Sma l restriction sites on both the 5*- and 3 - ends was gel purified and added to a ligation 
reaction containing the linearized form of plasmid pDL lll-1a, above, thus obtaining plasmid pDL lll-38b. The orientation 
of the alpha and beta globin genes in pDL lll-14c and pDL lll-38b was confirmed by restriction analysis. 

so Cells of E. coli strain JM109 were transformed with pDL lll-14c or pDL lll-38b and grown in 2xYT media containing 
ampicillin. Colonies were induced with IPTG as above. Individual clones were analyzed for their ability to produce des- 
val alpha and des-val beta globin polypeptides by SDS-PAGE and Western blotting. There was no appreciable difference 
between expression of i mmu nor eactive des-val alpha or des-val beta globins from the alpha-->beta orientation (pDL III- 
14c) or the beta->alpha orientation (pDL lll-38b) . . 
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Example 7: Analysis and Comparison of the Functional Characteristics of Recombinant Des-FX and Des-Val Hemo- 
globins 

JM109 cells expressing either des-FX hemoglobin (dFX-hgb) or des-Val hemoglobin (dV-hgb) were grown to an 
OD600 of 15 in a 10 liter fermenter and then induced by the addition of 300jim IPTG. Induction period was for 6 hrs. 
Cells were harvested by centrifugation and frozen at -80°C until processed. 

For purification of hgb approximately 200g of cells were resuspended in 350ml_of 50mM sodium phosphate (NaPi) 
buffer, pH 7.0 containing 200 units of aprotinin/mL and 20^g/ml DNAase 1. Ceils were then lysed by the addition of 
1 mg/mL of lysozyme and 4 passages through a Dynomill. Cellular debris was removed by centrifugation at 10,000 rpm 
in a Beckman JA14 rotor at 4°C for 40 min. The supernatant was added to 200rnL of a hemoglobin-binding resin equil- 
ibrated with 10mM NaPi, pH 7.0. The pH was adjusted to 7.0 with 10M NaOH or concentrated phosphoric acid. The 
resin was then loaded into a 5x30 cm chromatography column and allowed to settled. The column was then washed 
with 2.5 column volumes of 10mM NaPi, pH 7.0 containing 100 units/mL aprotinin. Hgb was eluted from the column in 
20mM Tris-HCI, pH 7.5 containing 100 units/mL aprotinin. This partially purified hgb was then 0.2 micron filtered and 
loaded onto a 1.6x10 cm Mono-Q anion exchange column equilibrated with 20mM Tris-HCI, pH 8.0. Hgb was eluted 
using a linear gradient of 0 to 0.4M NaCI in 20 mM Tris-HCI, pH 8.0. The material was then loaded onto a 1 .6x10 cm 
Mono-S cation exchange column. Hemoglobin was eluted with a linear gradient of 10mM NaPi, pH 7.0 to 10mM NaPi, 
pH 8.5, 160mM NaCI. The major peak of hgb was collected, concentrated to approximately 100rng/mL. and used for 
analysis. 

Functionality of the recombinant hgb was evaluated using a\Hemox analyzer at 25°C in 50mM HEPES, pH 7.4 
containing 0.1M Cl~. The following oxygen binding data were obtained: 


SAMPLE 

Pso 

N 

Ao 

4.03 

2.7 

dFX-hgb 

3.43 

2.6 

dV-hgb 

7.04 

2.8 


Of significance in these data are the following: 

1) The addition of an extra amino acid, methionine, on the N-termini of alpha and beta globins (dFX-hgb) appears 
to reduce slightly the P50 of the molecule but has little effect upon cooperativity (N): 

2) Replacement of the N-terminal valines of alpha and beta globins with methionine (dV-hgb) increases the P 50 of 
the molecule but has little effect upon the cooperativity (N). 

Example 8: Polvcistroni c Co-expression of Des-Val-Aloha/Alpha (Pi-Alpha) and Des-Val Beta Globin 

The overall synthetic plan for the preparation of a plasmid (pDL lll-47a) which co-expresses di-alpha globin and 
beta globin is given below. The starting materials are the commercially available transfer vectors M13mp19-RF, pKK 
223-3 and pGEM-1 , and the synthetic oligonucleotides described in the examples. 

For convenience, our manipulations began with plasmids, pDL ll-62m and pDL ll-10a. Plasmid pDL Il-62m was 
obtained by cloning the "FX alpha globin" gene into pKK 223-3 downstream of the Tac promoter. Plasmid pDL IMOa 
was prepared by an analogous insertion of the "FX beta-globin" gene. 

As set forth in greater detail in Figure 14, the "FX alpha globin" operon encodes two cistrons, the first expressing 
an octapeptide "loader", and the second, alpha globin preceded by Met-lle-Glu-Gly-Arg. The latter four amino acids 
constitute a recognition site for Factor X cleavage activity. The "FX beta globin" operon is similarly constructed. 

Since the Factor X recognition site was not needed here, the genetic material was manipulated to excise the "FX" 
codons. (This could have been avoided by synthesizing the desired di-alpha globin and des-val-beta globin genesdirectly 
rather than using the FX-alpha and FX-beta genes of pDL ll-62m and pDLll-IOa.) The FX-alpha globin gene cassette 
was excised and cloned into pGEM-1 to obtain pGEM FX-alpha. Similarly, the FX-beta gene of pDL I MOa was transferred 
to pGEM-1 to obtain pGEM FX-beta. 

The recognition site (FX)-encoding sequence could now be removed from pGEM FX-alpha and pGEM FX-beta to 
obtain pDL ll-91f and pDL ll-95a, respectively. The des-val alpha globin gene of pDL ll-91f was recloned into pKK 223- 
3 to generate pDL Ill-la, the gene being operably linked to the Tac promoter of pKK-223-3. The des-val beta globin gene 
of pDL ll-95a was purified and inserted downstream of the des-val alpha globin gene of pDL Ill-la to form a single 
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transcriptional unit which would encode a polycistronic alpha globin/beta globin mRNA, see pDL lll-14c. Finally, a syn- 
thetic oligonucleotide comprising the desired di-alpha linker encoding sequence and another copy of the alpha globin 
gene was inserted into pDL lll-14c to create pDL l!l-47a, wherein a Tac promoter controls transcription of a di-alpha 
globin gene and a des-val beta globin gene. 

c 

- Preparation of pDL lll-47a (di alpha/beta globin clone) 

The Eagl and Pstl restriction fragment containing most of the alpha globin gene from the plasmid pDL 11-91 f was 
gel purified and ligated to a synthetic DNA linker containing the sequence from the BstBI site of the alpha globin gene 
io to its carboxyl terminus, the glycine linker, and the amino terminus of alpha globin to the Eagl site (Figure 12). After 
digesting this ligation mixture with Pst I, the resulting fragment was cloned into BstBI/Pstl-cut pDL IIM4C to create 
plasmid pDL lll-47a (Figure 13). 

Expression of Di-alpha/Beta Hemoglobin 

15 

Individual E.coli clones were analyzed by Western blotting for production of dimeric alpha globin protein in combi- 
nation with monomeric beta globin. Appropriate plasmid construction was confirmed by digestion with EcoRI and 0.8% 
agarose gel electrophoresis. The EcoRI fragment present in the di-alpha constructs is approximately 1450 bp. 

Expression of genetically fused hemoglobin was accomplished using the IPTG induction protocol and S-sepharose 

20 purification of recombinant hemoglobin. E.coli cells (400 ml) were grown to an OD 60 o of 3.0 and induced with 1 mM IPTG. 
The cells were allowed to continue to grow for another 4 hours and then harvested by centrifugation. The cell pellet was 
resuspended in the 10mM sodium phosphate, pH 6.0 containing 1 mM benzamidine, 1 mM EDTA and 0. 1% Triton-X100. 
The cell suspension was then sonicated, centrifuged at 15,000xg for 15 minutes, and the supernatant loaded on to an 
S-Sepharose column equilibrated with 10mM sodium phosphate pH 6.0. After the sample was loaded on the column, 

25 the column was washed with 1 0 bed volumes of 1 0mM sodium phosphate pH 6.8. The dialpha hemoglobin was eluted 
from the column with 1 0mM sodium phosphate, pH 7.4, 30mM NaCI. Confirmation of hemoglobin production was accom- 
plished by visible light spectroscopy, SDS-PAGE, and Western blot analysis of purified material. 

Example 9: Preparation of Pi-Alpha Hemoglobin Low Affinity Mutants 

30 

In order to reduce the oxygen affinity of recombinant di-alpha hemoglobin, several mutations were introduced into 
the beta globin polypeptides using synthetic oligonucleotides. The restriction sites used to incorporate these mutants 
are shown in Table 3. For insertion of the Nagai (beta Val 67->lle) and Arg-Nagai (also beta Lys 82-> Arg) mutations, 
the des-Val-beta plasmid pDL ll-95a was digested with the restriction enzymes Ncol and Kpnl and gel purified. Oligo- 
35 nucelotides spanning these two restriction sites and containing the appropriate codon changes were synthesized, puri- 
fied, annealed and ligated into the gel purified plasmid. Following confirmation of correct sequence, the mutant des-Val- 
beta globin gene was excised with Pstl, Hindlll, gel purified, and cloned into plasmid pDL lll-47a. The beta globin gene 
containing the Kansas mutation (beta Asn 102=>Thr) was similarly constructed using Sacl and Spel restriction sites. 
Mutated codons for all of these beta globin mutations are shown in lower case letters in Table 3. 

AO 

Example 10: Characterization of Pi-Alpha Hemoglobins 
Oxygen Binding 

45 Oxygen binding measurements were made at 37°C in a Hemox Analyzer (Southampton, PA). The solutions were 
50mM Bis-Tris, pH 7.4, 0.1M NaC1 and 60uM heme equivalent of di-alpha hemoglobin. The solutions were measured 
between 120 and 1 .5 torr oxygen pressure. P 5 o values are given in Table 4. 

In Vivo Half Life 

50 

Di-Alpha Hgb (wild type) containing a gly-gly linker between alpha! , and alpha 2 was prepared as described previ- 
ously. The protein was formulated in 20mM NaP0 4 , pH 7.4 at a concentration of 95 mg/ml. Di-alpha Hgb was infused 
into male Sprague-Dawley rats 388 - 426 gm) at a dose of 875 mg/kg through a central venous catheter over 20-30 sec. 
Samples of blood were drawn at 2, 30, 80, 90, 120, 150, 180, 210, and 240 min. into heparinized vials. The blood was 
55 centrifuged to remove red blood cells and the plasma hemoglobin was assayed by absorbance at 540 nm. The percent 
hemoglobin remaining versus time was determined by comparison to the 2 min time point which was assumed to be a 
homogeneously mixed sample. The same experiment was repeated with non-fused des-val Hgb at a concentration of 
100 mg/ml. The data were averaged for each sample and plotted as percent Hgb remaining against time after infusion. 
The measured half-lives were 205 min and 104 min respectively for di-alpha Hgb and des-val Hgb, respectively. 
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Example 1 1 Co-expression of Wild Type Des-Val Aloha Globin and Pi-Alpha Globin with Des-Val Beta Globin Containing 
the Presbyterian Mutation 

Plasmid pSGE0.0-E4 is shown in Figure 14 and contains the following modifications as compared toother expression 
5 vectors derived from plasrrid pKK223-3. 

1) The plasnxj now contains a functional tetracycline resistance gene. 

2) The lad gene when encodes for the lac repressor protein has been incorporated into the plasmid. The lac repres- 
10 sor protein represses the TAG promoter until induction with IPTG. The repressor gene was inserted into the plasmid 

to permit transformation ol £. £oN cell lines which do not have endogenous lac repressor genes. 

The desVai beta jiotxn gene containing the Presbyterian mutation was constructed by insertion of a complementary 
pair of synthetic oii^onucieotOes mto the Sacl to Spel restriction of sites of pSGE0.0-E4. 
15 The following oivorv»cJeotOe*> (Pres-A and Pres-B) were used to construct the Presbyterian mutation in dVal-beta 
globin. 

Pres-A 5' CCACTG CG ACAAACTG CACGTTG ACCCGG (Continued below) 

20 Pres-B 3 * TCGAGGTGACGCTGTTTGACGTGCAACTGGGCC (Continued below) 

Sacj 


Pres-A AAAACTTCCCTCTGCTGGGTaaaGTA 3 1 

Pres-B TTTTCAACCCAGACGACCCAtttCATGATC 5 ■ 

30 

Following digestion with the two restriction enzymes the plasmid was gel purified to remove the wild type encoding 
DNA fragment- The annealed oligonucleotides were then ligated into the plasmid. Following transformation of JM109 
35 cells individual colonies were selected and analyzed for production of alpha and beta globins by IPTG induction and 
SDS-PAGE. Dideoxynucleotide sequencing was used to confirm the presence of the Presbyterian mutation. 

The mutant hemoglobin was produced, purified, and analyzed as described in Example 7 above. The following 
results were obtained: 


SAMPLE 

Pso 

N 

Ao 

4.03 

2.7 

dV-hgb 

7.04 

2.8 

dV-hgbPres 

33.0 

2.5 


if should be noted that the Presbyterian mutation, which results in the change of beta asparagine 108 to lysine, 
so decreases the affinity of the molecule for oxygen by 1 0 fold but does not affect the cooperativity of the molecule. 

The Presbyterian mutation has also been co- expressed with di-alpha globin containing a single glycine linker utilizing 
plasmid pSGE1 .1 -E4 (Figure 15). The result shown below indicate that the joining of the carboxy terminus of alpha 1 to 
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the amino terminus of alpha 2 has no effect on oxygen binding and cooperativity. 


SAMPLE 

P50 

N 

dV-hgbPres 
dialpha/Pres 

33.0 
33.0 

2.5 
2.4 


10 

Example 12: Construction of a Two Promoter System for the Co-expression of Pi- Alpha Globin and Beta Globin 

In this example, a di-alpha globin gene is operably linked to one promoter and a beta globin gene to a second 
15 promoter, but both genes reside on the same vector. Compare Examples 16 and 17, infra . 

Oligonucleotides (see below) encoding the sequence of the complementary strands of the TAC promoter (syn pTAC) 
and appropriate restriction enzyme sites were synthesized, gel purified, and annealed. Notice that the sequence com- 
plementary to the Xbal restriction enzyme site was designed to eliminate this restriction site when ligated into an authentic 
Xbal site. This was done to facilitate future manipulation of syn pTAC. 

20 

Sequence of syn pTAC 


25 


30 


35 


BamHI PstI 

5 1 G ATCCTGCAGAGCTGTTGACAATTAATCATCGGCTCCTATAATGTGtGG ( Cont • d ) 

3 • GACGTCTCGACAACTGTTAATTAGTAGCCGAGCATATTACACACC ( Cont • d ) 

Xbal COMP 
AATTGTGAGCGGATAACAATTTCACAC 3 • 

TTAACACTCGCCTATTGTTAAAGTGTGGATC 5« 


Plasmid pDL lll-47a (Figure 13) was digested with the restriction enzymes BamHI and Xbal, and the plasmid, 
now containing as an insert only part of the beta globin gene from the Xbal site to the Hindlll site (Figure 13a), was gel 
purified. 

40 Syn pTAC was then ligated into the plasmid to create plasmid pDL IV-64a and JM1 09 cells were transformed. Indi- 
vidual transformants were isolated and analyzed by IPTG induction and SDS-PAGE to confirm the presence of a func- 
tional TAC promoter. 

Plasmid pDL lll-47a was digested with the restriction enzymes PstI and Hindlll and gel purified to remove the beta 
globin coding sequences. Plasmid pDL lV-64a was digested with these same enzymes and the fragment encoding syn 
45 pTAC/beta gel purified. Ligation of the syn pTAC/beta fragment into the Pstl/Hindlll digested pDL lll-47a created plasmid 
pDL IV-67a (Figure 16). Individual transformants were screened for production of di-alpha and beta by IPTG induction 
and SDS-PAGE, 

The dVal-beta globin gene under the control of syn pTAC was also ligated into another location in pDL lll-47a. Syn 
pTAC/beta was removed from pDL IV-62a by digestion with Hind III and PstI. The restriction site overhangs were filled 
so with T4 polymerase and blunt end ligated into the Pvu II site of PDL lll-47a to create plasmid pJR Vl-54a (Figure 17). 
IPTG induction and SDS-PAGE analysis were as previously described. 

Results of induction experiments as evaluated by SDS-PAGE indicated that control of expression of di-alpha and 
beta by separate promoters gave little increase in the expression of either protein. Similarly, insertion of a second beta 

globin gene under regulation of a separate promoter had little effect upon production of the proteins. 
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Example 13: Hypothetical Protocol for Insertion of a Second Transiationallv coupled Beta Globin Gene Into the Di- 
Alpha/Beta Expression Plasmid 

Plasmid pDL lll-47a (Figure 13) is digested with Hindlll and Pstl and the Hindlll/Pstl fragment gel purified. The 
5 restriction site overhangs are then filled by T4 polymerase. The same plasmid is digested with Hindlll and filled in with 
T4 polymerase. The beta globin gene is then blunt end ligated into the plasmid (Figure 18). Following transformation, 
individual clones can be analyzed by restriction digest to confirm the presence and orientation of the inserted beta globin 
gene. IPTG induction and SDS-PAGE analysis can be used to evaluate production to di-alpha and beta globins. 

10 Example 14: Hypothetical Protocol for Expression of Pi-Beta Globin 

A di-beta globin gene, analogous to the di-alpha globin gene described above can be constructed as follows. Plasmid 
pDL ll-95a is a plasmid containing the des-val beta globin gene in the pGem-1 (Promega, Madison, Wl) vector. pDL II- 
95a would be digested with the restriction enzymes Pstl and BspM II and the large linear DNA sequence isolated by gel 
15 purification. A Separate pDL ll-95a plasmid would be restriction digested with Pstl and Nhe l. and the small Pstl I to Nhe ll 
fragment gel purified. Oligonucleotides encoding the di-beta linker containing the C-terminal amino acids of beta, and 
the N-terminal amino acids of beta 2 , connected by a linker peptide of variable length (See Example 15) would be syn- 
thesized, gel purified and annealed as described previously. The generic sequence for such an oligonucleotide connect- 
ing the C-terminal his of beta, to the N-terminal val of beta 2 would be: 

20 

Nhe l 

BspM II 
His Val 
5 1 -CTAGCTCACAAATACCAC (XXX) n GTTCACCTGACT- 3 ■ 
25 3 • -GAGTGTTTATGGTG ( XXX) n CAAGTGGACTGAGGCC— 5 • 


where the M XXX" indicates the codon(s) for the amino acid(s) linking the two beta globin peptides. Since the two beta 
30 globin termini are separated by approximately 18 angstroms in the deoxy state, it is anticipated that M n* would be in the 
range of 5 to 9. In the above example the oligonucleotide sequence spans the Nhe I restriction site, through the carboxyl 
terminal histidine of beta 1t a variable length coding sequence containing the amino acid linker to create di-beta, the 
amino terminal valine of beta 2 and the BspM II restriction site in the beta 2 gene. 

To construct the plasmid encoding the di-beta polypeptide, the unphosphorylated di-beta linker DNA with Nhel and 
35 BspMII restriction sites at its termini would be ligated to the Pstl /NHe l fragment. Following ligation the dimeric forms 
resulting from the ligation of the Pstl sites would be digested with Pstl to produce fragments containing only Pstl and 
BspM II termini. These fragments would then be subcloned into the Psti /BspM II cut plasmid. Following transformation, 
clones containing the di-beta construct would be confirmed by Pstl and Hind lll restriction fragment analysis. The linker 
region would be sequenced and the appropriate constructs subcloned into the expression plasmid containing either the 
40 di-alpha globin gene or a single alpha globin gene. E. coli would be transformed in the previously described fashion and 
assayed for production of hemoglobin as described before. 

Example 15: Hypothetical Protocol for Development of Linkers bv Mutation and Selection 

45 In this hypothetical example, the linker is obtained by mutagenesis of a linker-encoding DNA sequence and selection 
for functional linkers. Oligonucleotides spanning the BstBI site of alpha! to the Eaql site alpha 2 will be synthesized such 
that the six nucleotides comprising the preferred glycine-glycine linker are randomized. By randomizing these nucle- 
otides, codonsfor all combinations of amino acids will be present in the oligonucleotide mixture. Following purification 
and annealing of the oligonucleotides they will be used to construct the di-alpha/beta co-expression genes as described 

so above. 

Clones containing the various di-alpha/beta plasmids will then be screened for production of increased levels of 
recombinant hemoglobin using a protocol developed at the Company. E. coli clones will be arrayed on nitrocellulose 
filters overlayed on 2xYT-ampicillin plates containing 1 mM IPTG. Following overnight incubation of 37°C, the plates will 
be sealed in a plastic bag in which the air has been displaced by carbon monoxide (CO). CO binding to intracellular 
55 recombinant hemoglobin produces a distinctive red color in the E.coli colonies. Colonies producing the most intense red 
color will be further analyzed. 

In this experiment the assumption is made that certain combinations of amino acids in the di-alpha linker will permit 
more stable folding of the individual, linked alpha globin chains and, therefore, result in greater levels of production of 
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intracellular recombinant hemoglobin. This increase level of production will result in a more intense red color in the 
appropriate E.coli clones. 

After selection of several clones producing higher levels of recombinant hemoglobin, more detailed analyses will 
be done on individual clones to determine the optimal di-amino acid linker. The analyses will include determination of 
5 quantities of recombinant hemoglobin produced, oxygen affinity, and protein stability. Finally, clones found to be produc- 
ing the best quality recombinant hemoglobin will be DNA sequenced to determine the amino acids comprising the linker. 

Example 16: Hypothetical protocol for the synthesis of plasmids containing alpha and beta alobin genes under the 
regulation of two separate promoters on the same plasmid. 

w 

It is anticipated that recombinant hemoglobin can be expressed from constructs where the different globin genes 
are under the control of separate promoters. This situation would yield two separate mRNA's; one with a dicistronic 
sequence encoding an alpha globin gene and another with a dicistronic sequence encoding a beta globin gene. For 
construction of an expression system in which both the alpha and beta globin genes are under the regulation of separate 

is promoters, on the same plasmid, the following protocol would initially be used. Plasmid pDL Ill-la containing the des-val 
alpha globin gene would be digested with the restriction enzyme Bam HI. reacted with bacterial alkaline phosphatase, 
phenol extracted, ethanol precipitated, and resuspended in TE buffer. The plasmid pJR llll 50-a which is the pKK expres- 
sion plasmid containing the des-val beta construct would then be digested with the restriction enzymes Bam HI and Pvu l. 
to excise a fragment from the plasmid containing the P tec promoter, the des-val beta sequence, the transcriptional ter- 

20 minator sequence and a portion of the ampicillin resistance gene. Following gel purification of this fragment, a Pvu l- 
Bam HI linker would be synthesized and ligated onto the insert. The insert would then be back-cut with Bam HI to generate 
Bam HI compatible sites on both the 5 ( - and 3*- ends of the insert. This insert would then be cloned into Bam HI linearized 
plasmid pDL Ill-la, resulting in a plasmid in which a translationally coupled des-val beta globin gene under regulation of 
one p tac promoter is positioned on the 3* side of a translationally coupled des-alpha globin gene under regulation of a 

25 separate P tac promoter. Restriction enzyme mapping would be used to confirm the orientation of the beta globin con- 
taining insert. E. coli JM-109 would be transformed with the plasmid containing the separate P tec -globin constructs and 
grown in media containing ampicillin to isolate clones containing the plasmid. The clones containing the plasmid would 
then be induced with IPTG and expression of des-val alpha globin, des-val beta globin and des-val hemoglobin would 
be assayed by SDS-PAGE analysis, Western blotting with anti-hemoglobin antibodies and isolation of des-val hemoglobin 

30 by standard chromatographic methods. 

Alternatively, coexpression of both globin genes could be achieved from DNA sequences on separate vectors under 
the control of separate promoters. 

Example 17: Hypothetical Protocol for the Construction of Vectors Containing alpha and beta alobin genes under the 
35 regulation of separate promoters and on different vectors. 

E. coli clones containing plasmid pDL Ill-la which is the pKK223-3 plasmid containing a dicistronic loader gene/des- 
val alpha construct under the regulation of the P^ promoter would be transformed with a plasmid containing the dicis- 
tronic loader gene/des-val beta construct under control of the same promoter, but with a gene conferring additional 

40 antibiotic resistance to tetracycline. This could be constructed in the following manner: Plasmid pJ R IV-50a contains the 
des-val beta globin gene under control of the P^ promoter. This plasmid would be cut with Pvu l I to generate a linear 
plasmid with blunt ends. This would be ligated with a Notl phosphorylated linker (New England BioLabs). The ligation 
mixture will be used to transform E. coli . Plasmid DNA would be prepared and plasmids containing a Notl site identified 
by digestion with Notl agarose gel electrophoresis. This plasmid will contain the P^des-va! beta globin sequence. The 

45 gene for resistance to the antibiotic kanamycin is commercially available (Pharmacia) and contains Eco RI restriction 
sites on both ends. The ends will be converted to blunt ends by treatment with T4 DNA polymerase by the method of 
Maniatis, £l aL. The resulting fragment will be ligated with a 50 fold excess of phosphorylated Noil linker (25°C, 60 min). 
The ligation reaction would be made 0.01 M in EDTA, heated to 70° for 20 min and ethanol precipitated. The precipitated 
DNA will be taken up in 100 ul of Notl buffer and treated with 100 units of Not l (37°) for 2 hr. The fragment would be 

so purified by agarose gel electrophoresis. The Notl adapted kanamycin resistance gene would then be ligated into the 
Notl linearized pJR IV-50a to yield a plasmid with the gene for des-val beta globin under control of P tec with kanamycin 
resistance. E.coli JM-109 clones containing plasmid pDL 1 11-1 a, the plasmid containing des-val alpha globin under control 
of Ptac with resistance to ampicillin, will then be transformed with kanamycin resistant plasmid containing the gene for 
des-val beta globin and clones will be selected for resistance to both ampicillin and kanamycin. Other antibiotic resistance 

55 genes could be used as well. Expression of alpha and beta globin polypeptides under the regulation of separate pro- 
moters will then be analyzed by IPTG induction, SDS page and western blotting. 

Should we encounter problems with plasmid exclusion, we could use the same strategy with the pIN plasmids that 
have been used to express polypeptides from separate plasmids in £pJi (McNally, el aL, PNAS 85, 7270, 1988). 
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One potential problem that we may face with creating plasmids in which the alpha and beta giobin genes are under 
separate but identical promoters is the possibility of homologous recombination within the identical sequences on the 
plasmids, eg. the promotor region. This could result in deletion of a segment of important DNA sequence. It is therefore 
preferable to use different, non-homologous promotors for each different giobin gene, eg. and Ptrc , or the lambda 
5 P L promotor in appropriate host (containing cl857). 

Example 18: Synthesis and Assembly of the Pi-Aloha Beta Giobin Construct in aP L Regulated Vector System 

In prior examples, the giobin genes were under Tac promoter control, and the alpha (of di-alpha) and beta giobin 
io genes were each translationally coupled to the ribosomal loader cistron taught by Schoner, et al. In this example, the 
lambda P L promoter and a different translational coupler (see below) are used. 

Translational Coupler 


SD2 MET S£i 

5'AAT AAG GAG GAA TAA CAT ATG CTG TCT CCG GCC GAT (Cont^d) 
3'TTA TTC CTC CTT ATT GTA TAC GAC AGA £££ £££ £TA (Cont'd) 

EAGI 


AAG GCC CCA AGC TTG GGG3 • 
TTC CGG GG T TCG AA C CCC5 • 

Hind III 


The pL expression system has a different translational coupler as compared to the pTAC system. Sequences 
coding for the two SD's and the translational stop were added onto the 3' end of the N protein coding sequences to act 
35 as a translational coupler. Subsequent to that the giobin coding sequences are identical to that used in the pTAC system. 
Using the pPL-lambda vector available from Pharmacia, a plasmid construct was assembled to generate a genet- 
ically crossed-linked tetrameric human hemoglobin (wild type) in E. coli strains N99Ci+ and N4830-1 (cl857). These 
bacterial strains were obtained from Pharmacia and are inducable by addition of naladixic acid (40>g/ml) or mitomycin 
C (1 0^g/ml) in the presence of the wild type Ci+ repressor or heat treatment of the strain containing the CI857 repressor 
40 gene (Mott, et al., pNAS 82, 88, 1985 and Gottesmann, M.E„ et al., J.Mol. Biol. 14Q, 57, 1980). A diagramatic repre- 
sentation of the cloning strategy is depicted in figure 19. 

Removal of the Eaat Site from the pPL-Lambda Vector 

45 Removal of the Eagl site from pPL-lambda was necessary to enable cloning of the di-alpha gene sequence, because 
both alpha structural genes contains a Eagl site located 6 bp into the coding sequence. The pPL-lambda vector was 
digested with Eagl, and the ends were filled using T4 DNA polymerase. The Bam HI linker (5-CCCGGATCCGGG- 
S'XPharmacia), was blunt-end ligated to the Eagl digested pPL-lambda plasmid by standard methods. This eliminated 
• the Eagl site in the desired construct. The resulting mixture was digested with Eagl to eliminate any plasmids still con- 
so taining the Eagl site. E. coli N99Ci+ cells were transformed with resulting plasmid, pPL-lambda-E. Clones containing 
the desired plasmid were identified by restriction digest analysis. 

Incorporation of the Synthetic Co-Translational Coupler into pPL- Lambda- E 


Prior to inserting the giobin genes into the vector it was necessary to incorporate the synthetic translational coupler 
sequence into the Hpal site of pPL-lambda-E. This was done by digestion of pPL-lambda-E with Hpal followed by blunt- 
end ligation of the co-translational coupler into the Hpal site of the vector. Ligation of the coupler to the blunt end resulted 
in destruction of the Hpal site. The ligation mixture was treated with Hpal to digest any plasmid remaining containing 
the Hpal site. E. coli N99Ci+ cells were transformed with the resulting reaction mixture. Clones were screened with 
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EcoRI and Hind til restriction digests to identify clones containing the co-translational coupler in the proper orientation. 
DNA fragments of 522 bp and 4762 bp were observed for plasmid containing the desired orientation. To confirm the 
orientation of the coupler, the resulting plasmid was sequenced using a primer (5^AATGGAAAGCAGCAAATCC-3') 
complementary to the sequence 30 base pairs upstream from the translational coupler sequence. The desired plasmid 
5 was denoted as pPL-lambda-E+TC. 

Construction of an Expression Plasmid Containing Des-Val Alpha and Beta Genes Under Control of pPL- Lambda 

The Des-Val alpha and beta globin genes were obtained from pDL 1 1 1- 14c by digestion with Eagl and Hind III, followed 
io by agarose gel purification of the desired 942 bp segment. The purified alpha and beta globin gene fragment was cloned 
into Eatl and Hind III digested pPL-lambda-E+TC. The ligation mixture was used to transform E.coli N99Ci+, and clones 
wre screened for the presence of the desired plasmid, pPL-alpha/beta with EcoRi (4758 and 1468 bp), and with Pstl 
and Hind 1 1 1 (4005, 1 775, 520 bp) confirmed presence of the desired restriction sites. Further confirmation was obtained 
through sequencing with the 20 bp primer, above, to confirm the sequence between the co-translational coupler and the 
is Des-Val alpha globin gene, and a second primer (5'ACCCGGAAAACTTCCGTC-3*) to confirm the sequence between 
Des-Val beta and the pPL vector. 

Construction of an Expression Plasmid Containing Pi-Alpha and Beta Globin Genes Under control of pPL- Lambda 

20 An RGV linker which encodes for the carboxy terminal portion of alpha globin, linked via a single glycine residue to 
the native sequence of the amino portion of a second alpha globin chain, was prepared by separately phosphorylating 
the 5' ends of 5'CGAAATAACGTGGTGTTCTGTCTGC-3' and 3TTTATGGCACCACAAGACAGACGCCGG-5' with T4 
kinase, followed by annealing. This double stranded oligonucleotide was cloned onto the Eagl end of a purified fragment 
of Des-Val alpha and beta globin prepared from pDL IIM4c digested with Eagl and Hindlll, as described above. The 

25 linear DNA sequence generated from this ligation, now containing sticky ends coding for BstBI and Hindlll restrcition 
site sequences, was purified by agarose gel electrophoresis and cloned into BstBI and Hindlll digested pPL-alpha/beta. 
The new plasmid designated pPL-dialpha/beta contained a sequence with a co-translational coupler upstream to a 
sequence containing di-alpha globin linked via a glycine residue, followed by a cotranslational coupler adjacent to a beta 
globin gene sequence, all under conrol of a sinle P L promoter. These clones were identified through screening minipreps 

30 with Eag I restriction digestions. Clones without the second alpha globin gene merely linearised upon digestion, whereas 
clones containing the second gene released 431 bp and 6222 bp DNA fragments. 

Construction of the Expression Plasmid dSGEO.I-LO Containing a ROP- Origin of Replication Mutation 

35 pPL-dialpha/beta was digested with Pvull then treated with T4 DNA polymerase to fill in the sticky ends. The line- 
arized plasmid was then blunt end ligated witha Notl linker (Promega Corp., Madison, WI)(5TTGCGGCCGCAA-3'). The 
ligation mixture was then treated with Pvull to remove any remaining plasmid containing the Pvull site. E. coli were 
transformed with pSGE0.1-L0 and positive clones were identified by the presence of the unique Notl restriction site. 

40 Expression of Hemoglobin SGE0.1 in E. Coli Under the Control of the Lambda P L Promoter 

E. coli N99Ci+ and E. coli N4830-1 were transformed with pSGEO.1 -L0 and grown on agar plates containing amp- 
icillin, as described previously. These E. coli strains contain the cl+ repressor gene and the cl857 heat sensitive repressor 
gene, respectively. 

45 Inocula of N99Ci+ were grown at 37°C in TB media to an ODeoo of ~ 1 .0 and induced with naladixic acid (40^g/ml). 
Cultures were incubated for 4-6 hrs. at 37°C before the cells were harvested. Hemoglobin production was estimated by 
SDS-PAGE analysis of total cell protein and by western blot analysis. By these techniques, SGE0.1 was estimated to 
be produced at ~0.02% of the total cell protein in this cell line. Hemoglobin (57^g) was isolated by Mono Q chromatog- 
raphy and shown to have an optical spectrum representative of that for normal hemoglobin. 

so Inocula of N4830-1 were incubated at 30°C in TB media to an OD 60 o of 1 .0 and induced by addition of sufficient 
preheated TB media (65°C) to raise the temperature of the inocula TO 42°C. The culture was then incubated at 42° for 
4-6 hrs. Total cell protein analysis with SDS-PAGE revealted that SGE0.1 was being synthesized at 0.4% of the total 
cell protein, corresponding to 0.18 mg protein per gram of wet cell paste. SGE0.1 prepared from a 2L preparation was 
purified as described elsewhere, and resulted in isolation of 7.6 mg of purified material, SGE0.1 (7.6mg) was isolated 

55 and had a P 50 of 7.08 (Hemox Analyzer, pH 7.4, 0.1 M NaCI, 37°C), and an optical spectrum representative of native 
hemoglobin. 
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Example 19: Production of Hemoglobin in Yeast 

All restriction enzymes and DNA-modifying enzymes were purchased from BRL, New England Biolabs, IBI, Phar- 
macia or Boerhringer-Mannheim. The concentrations of enzymes used were those suggested by the supplier to produce 
a complete reaction in 30 minutes. The buffers and conditions for the use of these enzymes were those provided with 
the enzymes, unless otherwise stated. Plasmid DNA was purified from E.coli DH5a as described by Birnboim and Doly 
(Nucleic Acids Research 1979, 7:1513-1520). Electrophoretic analysis of DNA was carried out in agarose gels using 
tris-acetate electrophoresis buffer (Maniatis et al. Molecular Cloning . Cold Spring Harbor, NY, 1 982). DNA was visualized 
by staining the gels with 0.5jig/ml ethidium bromide and exposing the gel to ultraviolet light. DNA fragments were purified 
from agarose gels using a kit purchased from BIO-101 . DNA fragments were purified from acrylamide gels by crushing 
the excised gel fragment, containing the DNA, of interest in 3.25M ammonium acetate and incubating overnight at 37°C. 
Gel fragments are removed by centrifugation (12,000 x g, 15 min) and the DNA precipitated with 2 volumes of 95% 
ethanol, 5% isopropanol. The precipitate is dried in vacuo and dissolved in 0.1XTE (1XTE is 10mM Tris. HCI pH7.8, 
1 mM Na 3 EDTA), Acrylamide gel electrophoresis of DNA was done as described by Maniatis, et al. ( Molecular Cloning , 
Cold Spring Harbor, NY, 1982.) in tris-acetate electrophoresis buffer. Bacteriological growth media and DNA transfor- 
mation methods are described by R.W. Davis et al. (Advanced Bacterial Genetics, Cold Spring Harbor Laboratory, New 
York, 1980, p140-141). Transformation of S. cerevisiae with linear or circular DNA was carried out as described by H. 
Ito et al. (J. Bacteriology 1 53:1 63-1 68 (1 983)). Transformants were selected on SD medium lacking uracil or tryptophane 
(SD-ura, SD-trp) depending on the selectable marker on the plasmid (F. Sherman et al. p Methods in Yeast Genetics: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1979). All other yeast media used are described by Sherman et al 
(ibid.) 

SYNTHESIS AND ASSEMBLY OF A GALACTOSE REGULATED PROMOTER 

This synthetic promoter consists of two functional parts, a regulatory sequence and sequence that allows efficient 
initiation of mRNA synthesis. One of the regulatory regions we chose includes the nucleotide sequence that confers 
positive regulation of transcription in the presence of galactose (M. Johnston and R. Davis, 1 984. Molecular and Cellular 
Biology 4:1440-1448; L Guarente et al., 1982, Proc Nat Acad Sci (USA) 7^:7410-7414.). The transcriptional initiation 
site is derived from the consensus sequence for the S.cerevisiae glyceraldehyde-3-phosphate dehydrogenase gene 
(GAP491) (L. McAlister and M.J. Holland, J. Biol Chem 260:15019-15027, 1983; J.R Holland et al., J. Biol Chem 
258:5291-5299, 1983). 

The synthetic oligonucleotides shown in figure 1 were synthesized on a Biosearch 8600 DNA synthesizer. Each 
oligonucleotide was cleaved from its support column with 28% NH 4 OH. The blocking groups were removed by incubating 
the cleaved oligonucleotide in 28% NH 4 OH at 65°C for s 16hr. All oligonucleotides were purified by preparative poly- 
acrylamide gel electrophoresis in slabs of 10% acrylamide (19:1 acrylamide:bis-acrylamide) containing 7M urea. Oli- 
gonucleotides were eluted from acrylamide slices by incubation (16hr) in 50mM ammonium acetate (pH7.4), 2.5mM 
magnesium acetate, 0.25mM EDTA and 0.25% SDS at 37°C. Acrylamide fragments were removed by centrifugation 
(14,000xg, 10min) and the oligonucleotide precipitated from the aqueous phase by the addition of NaCI to 0.25M and 
3 volumes of 100% ethanol. The precipitated oligonucleotides were collected by centrifugation (14,000xg for 30min), 
washed twice with 80% ethanol, once with 100% ethanol and the pellets dried. Pellets were dissolved in 0.1 XTE. 2 x 
1 0~ 1 o moles (each) of oligonucleotides 1 -5 (Table 5) were phosphorylated in 0.02 ml of 0.066M Tris HCI (pH 7.6), 0.01 M 
MgCI 2 , 0.002M dithiothreitol (DTT), 0.001 M spermidine and 10 units of T4 polynucleotide kinase. Phosphorylation reac- 
tions were carried out at 37°C for 30min and terminated by heating to 96°C for 5 min. 2 x 1 0" 1 0 moles of oligonucleotides 
1 and 6 (Table 5) were added to the phosphorylated oligonucleotides in a final volume of 0.04ml of 0.07M Tris HCI (pH7.6) 
0.01 M MgCI 2 . The mixture of oligonucleotides 1 -6 (Table 5) was heated to 96°C for 5 min, 75°C for 20 min, 55°C for 30 
min, 37°C for 60 min and 25°C for 1 5 min. T4 DNA ligase (1 0 units), DTT (0.002M final concentration), and ATP (0.001 M) 
were added and the mixture incubated at 4°C for 1 6hr. The resulting 210 bp oligonucleotide contains a 5' end compatible 
with a Sail restriction endonuclease site and a 3' end compatible with an Xbal site. Because the oligonucleotides com- 
prising the two ends of the intact oligonucleotide were not phosphorylated, they cannot ligate to each other. This oligo- 
nucleotide was cloned into the vector pSK+ (Stratagene, Inc.) (Figure 21 (a)) that had been digested with Xbal and Sail. 
The ligation mixture contained 50ng of Xbal, Sail digested pSK(+), and SpMoles of the ligated oligonucleotide in a volume 
of 0.01ml. E.coli DH5a was transformed with a portion of the ligation reaction and clones that contain inserts were 
identified by screening for white colonies on LB-ampicillin (0.15mg/ml) agar plates supplemented with XGAL (4jig/ml). 
Positive identification was made by preparing plasmid DNA from these isolates and digesting with Xbal and Sail. The 
restriction digests were analyzed by agarose gel electrophoresis and three clones containing a fragment of the expected 
size (~ 250bp) were identified. The DNA sequence of all three clones was determined and one was chosen for further 
use and designated pGS2488 (Figure 21 (a)). 
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ASSEMBLY OF THE SYNTHETIC GALACTOSE UPSTREAM ACTIVATOR (GAL U ft c) SEQUENCE 


The oligonucleotides shown in Table 6 were synthesized and purified as described above. Oligonucleotides 2-5 
were phosphorylated, annealed with oligonucleotides 1 and 6 and ligated as described for the assembly of GAR The 
full length oligonucleotide generated by this protocol has non-phosphorylated ends compatible with the restriction endo- 
nuclease sites generated by Sphl and Sail. However, when the oligonucleotide is ligated to a Sail site the resulting 
junction formed between the two fragments will no longer contain a cleavable Sail site. 

The GAL UAS is contained on an SPHI - Sail fragment. To clone this fragment into pGS2488 required that we change 
the Kpnl site of this plasmid to an Sphl site. The plasmid pGS2488 was modified by cleaving with Kpnl. The Kpnl digested 
plasmid was incubated with 2 units of T4 polymerase in 0.05ml buffer containing SOjuM of each deoxyribonucleotide 
triphosphate (A,G,C,T), 0.033M Tris-acetate (pH 7.9), 0.066M potassium acetate, 0.01 M magnesium acetate, 0.5mM 
DTT and lOO^ig/ml bovine serum albumin (BSA). Na 3 EDTA was added to 0.01 5M and the mixture extracted IX with 
phenol-chloroform. DNA was precipitated with ethanol. The dry pellet was dissolved in 0.008ml of T4 DNA ligase buffer, 
50ng phosphorylated Sphl linkers (New England Biolabs) and 10 units of T4 DNA ligase. The mixture was incubated 
for 1 hour at 25°C and used to transform E.coli DH5ot. Plasmid DNA was prepared from 1 2 transformants and tested by 
restriction enzyme digestion and agarose gel electrophoresis, for the presence of an Sphl site and the absence of a 
Kpnl site. A clone containing a plasmid with these characteristics was identified and the plasmid was designated 
pGS2888 (Figure 21(b)). 

The next step in the assembly of this hybrid promoter was to clone the Sphl - Sail fragment containing the GAL UAS 
into pGS2888. pGS2888 was digested with Sphl and Sail, phenol-chloroform extracted and ethanol precipitated. Fifty 
nanograms of Sphl, Sail digested pGS2888 was incubated with 25ng of the annealed, ligated GALy AS mixture in 0.005ml 
1 X ligase buffer containing 1 0 units of T4 DNA ligase. The ligation mixture was incubated overnight at 4°C and a portion 
used to transform E.coli DHSct. Ampicillin resistant clones were isolated and plasmid DNA prepared. The plasmid DNA 
(digested with Xbal and Spl) was analyzed by agarose gel electrophoresis. A plasmid containing a fragment of the 
expected size (~ 500bp) was identified. The sequence of the putative GAL UAS portion of this plasmid was determined 
and the plasmid was designated pGS4788 (Figure 21 (b)). The complete sequence of the synthetic GALGAP promotor 
(pGGAP) is shown in figure 20. 

CONSTRUCTION OF A pGGAP-6-GLOBIN EXPRESSION CASSETTE 

The plasmid pLCIIFX-p-giobin (K. Nagai, M. Perutz and C. Payart, Proc Nat Acad Sci (USA) 82:7252-7255) was 
used as the source of human p-globin cDNA. The coding region of the p-globin cDNA can be excised as an ApaL1 to 
Hind3 fragment that is missing only the first four nucleotides of the p-globin coding (translated) sequence. pLcFX p- 
globin (5\iq) was digested with ApaL1 and Hind3 and a 550 bp fragment containing the cDNA was purified by acrylamide 
gel electrophoresis. The ApaL1 - Hind3 fragment containing the p-globin cDNA was cloned into Xba1 , Hind3 digested 
pUC1 9 using the following adaptor (synthesized and purified as described above): 

Xbal Ncol ApaLl 
YHla/bS • -CTAGAACCATGG 

TTGGTACCACGT-5 » 


The two oligonucleotides were mixed (12.6jig of each), NaCI was added to 0.25M and three volumes of ethanol 
(anhydrous) was added. The precipitated oligonucleotides were collected by centrrfugation (14,000 x g, 15 min) and the 
pellet washed twice with 80% ethanol, once with anhydrous ethanol and dried in vacuo. The adaptors were dissolved 
in 1ml of 0.1 XTE. Fifty nanograms of Xba1-Hind3 fragments containing the p-globin cDNA and 12.5ng of the ethanol 
precipitated (non-phosphorylated) YH1a,b oligonucleotide in 0.010 ml of T4 DNA ligase buffer containing 10 units of T4 
DNA ligase. The ligation mixture was incubated at 4°C for 16 hr. and a portion used to transform E.coli DH5a. Trans- 
formants were selected on LB-ampiciilin plates containing 4^g/ml XGAL. Plasmid DNA was prepared from 12 white 
colonies. The DNA was digested with Nco1 or ApaLl and analyzed by agarose ge! electrophoresis. Four of these colonies 
contained plasmids with the expected restriction fragments and one was designated pUCl9p-globin (Figure 21 (a)). The 
plasmid pSUC2-6L (G. Stetler et al. Biotechnology 7:55-60 (1989)) (Figure 21(a)) was digested with Hind3 and Xbal 
and the large fragment was purified by agarose gel electrophoresis. The Xbal to Hind3 fragment containing p-globin 
cDNA was also purified (agarose gel electrophoresis) from pUC19p-globin. Ten nanograms of gel-purified Xbal , Hind3 
digested pSUC2-6L was mixed with 16 nanograms of the Xbal , Hind3 fragment from pUC19p-globin in 0.01ml of ligase 
buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated at 25°C for 1 hr. and a portion used to 
transform E.coli DH5a. Transformants were selected on LB-ampicillin medium and three were used to prepare plasmid 
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DNA. These were analyzed by digestion with EcoR1 and analyzed by agarose gel electrophoresis. Two of these con- 
tained plasmids with the expected restriction fragments and one was designated pGS1 1 88 (Figure 21 (b)). This plasmid 
contains p-globin under the transcriptional control of the sigma promoter and contains the 3' transcriptional termination 
signals and polyadenylation signals of the MFa1 gene. 

5 

REPLACEMENT OF THE SIGMA PROMOTER WITH pGGAP 

The plasmid pGSl 188 was digested with Sphl and Ncol and the vector plus 0-globin cDNA was separated from the 
sigma promoter by agarose gel electrophoresis and purified as described previously. Plasmid pGS4788 (1 Oug) was also 

10 digested with Sphl and Ncol and the *" 5O0bp fragment (containing pGGAP) produced by this digest was purified by 
agarose gel electrophoresis Fifty nanograms of the gel-purified p-globin containing vector was incubated with 50ng of 
Ncol-Sphl fragment containing the pGGAP promotor in 0.01ml of 1X ligase buffer with 10 units of T4 DNA ligase. The 
mixture was incubated for 1 hour at 25°C and a portion of the ligation mixture used to transform E.coli DH5ct. Ampicillin 
resistant clones were selected. Plasmid DNA isolated from 1 2 of these clones was analyzed by digestion with Sphl and 

is Ncol to identify plasmids containing the ~ 500 bp GGAP promoter. The presence of the p-globin cDNA was confirmed 
by digestion with Xbal and Sail, followed by agarose gel electrophoresis analysis. A plasmid containing the expected 
fragments was identified and designated pGS3588 (Figure 21(b)). To aid the subdoning of this fragment into a yeast 
vector, the Smal site of pGS3588 was converted to a Xhol site as follows: 1 fig of pGS3588 was digested with Smal, 
the digest was extracted once with phenol-chloroform and ethanol precipitated. The precipitated DNA was dissolved in 

20 T4 DNA ligase buffer, with 1 0Ong of phosphorylated Xhol linker and 1 0 units of T4 DNA ligase (final volume of 0.01 ml). 
The ligation was incubated at room temperature for 2 hr and a portion of the ligation mixture was used to transform E.coli 
DH5a (excess linkers were not removed prior to transformation). Ampicillin resistant clones were isolated and plasmid 
DNA prepared. Plasmids containing the additional Xhol site were identified by digestion with Xhol and agarose gel 
electrophoresis analysis. A plasmid containing the pGGAP-p-globin expression cassette was identified and has been 

25 designated pGS3888 (Figure 21 (b)). 

CONSTRUCTION OF AN a-GLOBIN EXPRESSION CASSETTE 

We obtained a partial length cDNA (pct-MRC) clone from K. Nagai (MRC, Cambridge). To adapt the cDNA encoding 

30 a-globin for expression from the pGGAP promotor, two oligonucleotide primers were synthesized (synthesis and purifi- 
cation of oligonucleotides was as described above). 
a-1 : 5 , -GAATTCCATGGTGCTGTCTCCTGCCGACAAGACC-3\ 
a-2 : S'-CTGCAGTCG ACTTAACGGTATTTGG AGGTCAGC ACGGTGCT-3 \ 

These two oligonucleotides were used as primers for a polymerase chain reaction (PCR) (R.K. Sakai et al., 1985. 

35 Science 23Q: 1350-1 354) using a Perkin Elmer-Cetus PCR kit and pa-MRC as template, a-globin cDNA (8.3ug/mi) in 
0.018 ml H 2 0 was denatured by the addition of 0.005ml 10M NaOH. The mixture was incubated at 25°C for 5 min. 
Denatured DNA was precipitated by the addition of 0.003ml 3M sodium acetate (pH 5.2) and 0.075ml anhydrous ethanol. 
The precipitated DNA was washed twice with 80% ethanol. once with 100% ethanol and dried. The dried pellet was 
dissolved in: 0.005ml 20jaM a-1 , 0.005ml 20|xM a-2, 0.01 0ml 10X Taq polymerase buffer (Perkin Elmer-Cetus), 0.0005ml 

40 Taql polymerase (Perkin Elmer-Cetus), 0.663ml HzO, 0.016ml of all four deoxyribonucleotidetriphosphates (1.25mM 
each). Taql polymerase was added after heating the solution to 94°Cfor 1 min. After addition of the enzyme the aqueous 
solution was overlayed with 0.100ml paraffin oil. The reaction-mixture was cycled, by hand, 25 times at the following 
temperatures: 37°C for 2 min, 68.5°C for 3min and 94°C for 1 min. After the twenty-fifth cycle, the reaction mix was 
incubated at 37°C for 2 min and 68.5°C for 1 0 min. A portion (0.005ml) of the reaction mixture was analyzed by agarose 

45 gel electrophoresis and revealed a band of the expected size (~ 430bp). The PCR-amplified DNA fragment should 
contain a 5' extension, that includes an ATG codon embedded in an optimal sequence context for initiation of translation 
(M. Kozack 1 986, Cell 44:283-292). The 3* end should contain an extension that includes a translational terminator and 
a Sail restriction endonuciease site. The PCR-amplification reaction was phenol-chloroform extracted and ethanol pre- 
■ cipitated. The dry pellet was dissolved in 0.05ml ImM Tris-HCI (pH 7.8). A portion of this material (0.025ml, 1 .25ug) was 

so digested with Ncol and Sail and purified by acrylamide gel (5%) electrophoresis. A gel slice containing the fragment 
was eluted by crushing the gel slice in 0.3ml 2.5 ammonium acetate (pH 7.4) and incubating at 37°C for 1 6hr. Acrylamide 
fragments were removed by centrif ugation and the DNA precipitated by the addition of 0.75ml of ethanol. The pellet was 
collected and dissolved in 0.020ml 1 mM Tris HCI (pH 7.8) 0.1 mM EDTA. This fragment was cloned into Ncol, SI digested 
and agarose gel purified pGS3888. Fifty nanograms of Ncol Sail digested pGS3888 was incubated (2hr, 25°C) with 

55 50ng of gel purified. PCR-amplified Ncol-Sall fragment containing the a-globin cDNA in 0.01ml ligase buffer with 10 
units of T4 DNA ligase. A portion of this reaction mixture was used to transform E.coli DH5a- and ampicillin resistant 
clones were selected on LB-ampiciflin medium. Plasmid DNA was prepared from 12 independent isolates and digested 
with Ncol and Sail. The restriction digests were analyzed by acrylamide gel electrophoresis (5%). all twelve contained 
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a fragment of the expected size. One of these was designated pGS4088 (Figure 22). The a-globin insert in pGS4088 
was completely sequenced to assure that no mutations had been introduced by PCR-amplif ication. 

CONSTRUCTION OF A YEAST EXPRESSION PLASMID THAT CO-EXPRESSES a- AND ft-GLQBIN GENES FROM 
5 A SINGLE PLASMID 

The a-globin and p-globin expression cassettes from the plasmids pGS3888 and pGS4088 were cloned into a single 
plasmid in a way that allows them to be excised on a single Notl fragment. This Notl fragment was then cloned into the 
high copy yeast plasmid pC1N to generate a plasmid carrying arid expressing both a-and p-globin chains under the 
io control of separate (though identical) promoters. The details are presented below. 

INTRODUCTION OF NOT! SITE INTO pSK(+) 

The plasmid pSK(+) (Stratagene, Inc.) (Figure 23(a)) was modified by digesting 100ng of purified plasmid DNA with 
15 Kpnl. After digestion was complete, the DNA was ethanol precipitated and the dry pellet dissolved in 0.05ml T4 DNA 
polymerase buffer containing 10 units of T4 DNA polymerase. The reaction mixture was incubated at 37°C for 20 min, 
Na 3 EDTA (1 OmM) was added and the sample heated to 70°C for 1 0 min. The digested DNA was precipitated with ethanol 
and the dry pellet dissolved in 0.01 ml of 1 0mM Tris HCI (7.8) 1 mM EDTA. A portion of this material (20ug) was dissolved 
in 0.005ml of ligase buffer containing 10 units of T4 DNA ligase and 50ng of phosphoryiated Notl linkers. The ligation 
20 mixture was incubated at 25°C for 2 hr and a portion used to transform E.coli DH5a. Ampicillin resistant colonies were 
selected on LB-ampicillin medium. Plasmid DNA was isolated, digested with Notl and analyzed by acrylamide gel (5%) 
electrophoresis. A plasmid containing the additional Notl site is expected to generate a new, ~ 90bp fragment. Such a 
plasmid was identified and designated pSN(+) (Figure 23(a)). 

25 CLONING THE pGGAP-ot-GLOBIN EXPRESSION CASSETTE INTOpSNO) 

pSN(+) was digested with Sail, phenol -chloroform extracted and ethanol precipitated. The precipitated DNA was 
dissolved (50ug/ml) in 1 mM Tris-HCI (pH 7.8), 0. 1 mM EDTA. pGS4088 wasdigested with Xhol and the ~ 1 1 0Obp fragment 
containing the a-globin expression cassette was isolated from an agarose gel. The purified fragment was dissolved 

30 (20ug/ml) in 0.1 XTE. Twenty-five nanograms of Sail digested pSN(+) was mixed with 50ng of the gel-purified a-globin 
fragment in 0.01 ml ligase buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated for 1 .5 hr at 
25°C and a portion used to transform E.coli DH5a. Ampicillin resistant clones were isolated. One hundred transformants 
were transferred, in 'a grid pattern, to fresh plates. Replicas of the grid were made on a nitrocellulose filter and prepared 
for colony hybridization (R.W. Davis et al. Adv Bacterial Genetics . Cold Springs Harbor, NY, 1980.). The hybridization 

35 probe was oligonucleotide 4 (Table 6) . This oligonucleotide (20pM) was labelled with 3 2 P-ATP in a 0.02ml reaction mixture 
containing 0.050M Tris-HCI (pH 7.6), 0.01M MgCI 2 , 0.005M DTT, 0.0001M spermidine, 0.0001M EDTA, 50pM of 32P- 
ATP and 10 units of T4 polynucleotide kinase. The reaction mixture was incubated for 2hr at 37°C, unincorporated ATP 
was removed by spin-column chromatography (BIORAD) used according to the manufactures instructions. The filters 
were incubated (37°C) in hybridization solution (6XSSC, 50% formamide, 2% SDS f 20pMoles of ^P labelled probe) for 

40 16 hr. The filters were washed 4 times (15 min each) with IX SSC, 0.1% SDS at 55°C; twice (5 min each) with 0.2XSSC, 
0.1% SDS at 50°C and twice (5 min each) with 0.2XSSC. Autoradiographs were made of the filters. Colonies that pro- 
duced a hybridization signal were used to prepare plasmid DNA. This DNA was digested with EcoR1 and the fragments 
produced by this digest were analyzed by agarose gel electrophoresis. A plasmid containing a fragment of the correct 
size was identified and designated pGS4888 (Figure 23(a)). 

45 

CLONING THE pGGAP-p-GLOBIN EXPRESSION CASSETTE INTO pGS4888 

The construction of pGS4888 required cloning an Xhol fragment (pGGAP -a-globin) into a Sail site. The combination 
of a Xhol site with a Sail site destroys the recognition sites for both Xhol and Sail restriction endonudeases, leaving 

so pGS4888 with a single unique Xhol site. An Xhol fragment from pGS3888 was purified by agarose gel electrophoresis 
and cloned into Xhol digested pGS4888 essentially as described for the construction of pGS4888. Colony filter hybrid- 
izations were performed using the Ncol-Sall fragment (containing p-globin cDNA) from pGS3888 as a hybridization 
probe. This fragment was purified by agarose gel electrophoresis and radioactively labelled with tx32p<iCTP using an 
Amersham random oligonucleotide labelling kit. Filters were hybridized, washed and autoradiographs made as described 

55 for the construction of pGS4888. Plasmid DNA was prepared from colonies producing a hybridization signal and digested 
with either Ncol or Ncol and Sphl. These digests allow identification of plasmids that contain both the a- and p-globin 
expression cassettes and reveals the relative orientation of the a- and p-globin transcriptional units. A plasmid containing 
both transcriptional units was identified and given the designation pGS189 (Figure 23(b)). 
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CONSTRUCTION OF PLASMID pC1N 

The plasmid pClU was modified to introduce a Notl site. pClU DNA (lOOug) was digested with Sail, phenol-chlo- 
roform extracted and ethanol precipitated. The Sail ends were modified with T4 DNA polymerase to produce "blunt" 

s ends and phosphorylated Notl linkers were added. The procedures used for these modifications are essentially the same 
as those used to produce pSN(+) (described above). The plasmid resulting from these manipulations is called pCIN 
(Figure 23(b)). This pC1 N was digested with Notl , phenol-chloroform extracted and ethanol precipitated. A ~ 2.4kb Notl 
fragment (carrying a- and p-globin expression cassettes) was purified from Notldigested pGS189 DNA by agarose gel 
electrophoresis. Fifty nanograms of Notl digested pCl N was mixed with 50ng of the gel purified, 2.4 kb fragment isolated 

w from pGS189 in 0.01 ml of ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at^ 
25°C for 2hr and a portion used to transform E.coli DH5a. Ampicillin resistant clones were selected on LB-ampicillin 
plates and plasmid DNA prepared. The purified DNA was digested with EcoRI and analyzed by agarose gel electro- 
phoresis to identify plasmids carrying the a- and p-globin genes and to determine the orientation of the insert with respect 
to the vector. Two plasmids were identified, representing the two possible orientations and have been designated pGS289 

is (Figure 23(b)) and pGS389 (Figure 23(b)). 

EXPRESSION OF RECOMBINANT HUMAN HEMOGLOBIN IN SACCHAROMYCES CEREVISIAE 

S. cerevisiae strains GSY112 (MATotpep4::H1S3prb1 1.6R his3 200 ura3-52 leu2::hisG can1 cir°) and RSY334 
20 (MATa- reg! -501 pep4-3 prbl -1 1 22 ura3-52 leu2-3, Ieu2-1 1 2) were transformed with plasmids pGS289 or pGS389 by 
the method of ito et al. (J. Bacteriology 153:163-168 (1983)). Transformants were selected on yeast SD-ura. Single 
colony isolates were picked and streaked to SD medium lacking uracil and leucine. Colonies from this selective medium 
(SD-ura,-leu) were used to inoculate 2ml of SD-ura.-leu, the cultures were incubated at 30°C for 24hr and used to 
inoculate 20ml cultures of YP + 3% ethanol, these were incubated for an additional 24hr and galactose was added to a 
25 concentration of 2%. Samples were removed at 4, 8, 24, 28, 32 and 48 hours post induction. Cells were collected by 
centrifugation and washed with 1 ml 10mM Tris HCI (pH 7.8). 1 mM EDTA and resuspended in SDS-PAGE sample buffer 
(2 x 1 08 cells/ml). Samples were boiled for 10 min and the insoluble material removed by centrifugation, 0.005ml samples 
were analyzed by SDS-PAGE (12.5% gel) followed by transfer to nitrocellulose, a- and p-globin chains were stained 
using commercially available rabbit anti-human hemoglobin and an immunoblotting kit purchased from Promega. The 
30 protocols used were supplied by Promega. The immunoblot indicated that both chains are synthesized and that pGS289 
produces slightly less material than pGS389. An apparent lack of stoichiometry between the a-chain (lower band) and 
p-chain is due to a difference in immunoreactivity of the antibody to the two chains. This was demonstrated by comparing 
Commassie brilliant blue stained gels of purified human and recombinant hemoglobin with immunoblotted samples. 

35 Example 20: Characterization Of Human Recombinant Hemoglobin Synthesized in Yeast 

RSY334[pGS389] was grown in 100ml SD-leucine to an OD 600 of 2.4. This was used to inoculate 1L of YPE. This 
culture was shaken at 30°C, for 24hr at which time 50ml of 40% galactose was added. The incubation was continued 
and the cells harvested 24 hr later (OD 6 oo of 9). The cell pellet was resuspended in 100ml of 0.01 M Tris-HCI (pH 7.18), 

40 0.001 M EDTA and carbon monoxide bubbled through the cell suspension for 3 min. The cells were collected by centrif- 
ugation and after the CO treatment were distinctly red. The cell pellet (19 gm) was resuspended in 19ml of lysis buffer 
(0.01 M NaP0 4 pH 6.0, 0.020M DTT, 1% Triton X 100. 0.001 M PMSF, 0.005M benzamidine and O.OSmM leupeptin) and 
bubbled with carbon monoxide for 2 min. The cell suspension was sonicated with Branson 250 sonicator equipped with 
a 0.5 inch disrupter horn. The sonication time was 2 min (0.5 sec pulses) at full power. This was repeated 4 times with 

45 2 min cool-down periods between sonication intervals. The cell debris was removed by centrifugation (27,000 x g for 1 5 
min) and the supernatant saved (0°C). The pellet was resuspended in 5ml of lysis buffer. The resuspended pellet was 
sonicated with the microtip as described above (70% of maximum output). The cell debris was removed by centrifugation 
as described above and this supernatant combined with the first. The combined solutions were clarified by centrifugation 
r (38,000 x g, 20 min), producing a clear, red solution. This was loaded (after adjusting thepH to 6.0, with 10mM phosphoric 

so acid) onto a 5ml S-Sepharose fast flow column equilibrated with 0.01 M sodium phosphate (pH 6.0). All of the red material 
bound to the column and the column was washed with 20ml of 0.01 M sodium phosphate (pH 6.0). The column was 
eluted with 0.05M sodium phosphate (pH 7.5), 0.1 M NaCI and 1 ml fractions were collected. The red color eluted in two 
fractions. The purity of this material was analyzed by SDS-PAGE and appears to be s 50% pure after the first chroma- 
tography step. This material was dialyzed against 1 0mM sodium phosphate (pH 6.0), 0.001 M EDTA for 1 6 hr at 0°C and 

55 re-chromatographed on a mono-S FPLC column (ph 6.8 - pH 9.0. 0.02M sodium phosphate gradient). The peak fraction 
from this is s 85% pure. An absorbtion spectrum was obtained by scanning the mono-S purified material from 400mM 
to 650mM with a Shimadzu spectrophotometer (Figure 24, top). The spectrum obtained was identical to that of human 
hemoglobin (Figure 24, bottom), indicating that the protein had folded and incorporated heme. The amount of hemoglobin 
recovered in the two peak fractions was determined from the extinction coefficient (1.23 x 10 4 A450/M) to be ~ 20mg. 
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Example 21 Construction of Vectors for the Expression of a- and ft-Globin from Separate Yeast Plasmids 

In addition to the development of a single yeast vector that carries both a- and p-globin expression cassettes we 
also developed a system mat uses separate plasmids for each of the twoglobin cDNA's. The two plasmids each carry 
5 two yeast g-?nes mat are used to maintain the plasmid in yeast. Both have the LEU2 gene in common and one (pGS4688) 
has me URA3 gene the ome* (pGS4988) has the TRP1 gene. By using a host that carries mutations in URA3, LEU2 
and TRPi ton piavnos (one with a-globin and the other with the p-globin expression cassette) can be maintained. 
The constructions of these vectors is described below. 

io CONSTR UCT QN OF pClT 

The pusn*d pOU (bug) was digested with BamHI and Sail and the largest fragment was purified by agarose gel 
electrophoresis "he piasmrf YRp7 (10ug) (J. Strathern, E. Jones, and J. Broach, The Molecular Biology of the Yeast 
Sacchatpfrytyy CofcJ Sc* tog Harbor, NY, 1 981 ) was digested with Bglll and Sail and the fragment containing the TRPI 

75 gene was puif»*d tr> *<*voi*t g«H electrophoresis. Twenty-five ng of gel-purified, BamHI and Sail digested pC1U was 
mixed with *Ot*g erf r>* Bg n Snii fragment in ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture 
was incubated a!^ 'Cto» 1 5 tv and a portion of the ligation reaction mixture used to transform E.coli DH5a. Tetracycline 
resistant co*onies e sheeted on LB-tetracycl ine medium. Plasmid DNA was prepared from 1 5 transformants, digested 
with EcoRi and analyzed t>y agarose gel electrophoresis. One isolate with me expected EcoR1 restriction fragments 

20 was chosen and <*as doc*gna?ed pClT (Figure 22(a)). 

CLONING TH E f* GLOB IN E XPRESSION CASSETTE IN pC1T 

One huno>#y1 rwnogiams of pClT was digested with Sail, phenol extracted, ethanol precipitated and dissolved in 
25 0.01ml of 10mM Tr* HCi (pH 7 8). 1mM EDTA. The plasmid pGS3888 was digested with Xhol and the ~ 1.2 kb Xhol 
fragment containing the ^-gioom expression cassette was purified by agarose gel electrophoresis. Ten nanograms of 
Sail digested pClT was mixed win 60ng of the Xhol fragment containing the p-gfobin expression cassette in 0.01ml of 
ligase buffer containing io unrts of T4 DNA ligase. The reaction mixture was incubated at 25°C for 30 min. A portion of 
this material was used to transtor m E.coli DH5cc One hundred ampicillin resistant transformants were picked and patched 
30 to LB-ampicillin agar They were incubated for 5 hr at 37°C and overlayed with a nitrocellulose filter. Plasmids containing 
the Xhol fragment were identrf ied by colony hybridization, using the Xhol fragment as a hybridization probe, as described 
above. Colonies producing an autoradiographic signal were used to prepare plasmid DNA. The purified DNA was 
digested with EcoR i and analyzed by agarose gel electrophoresis for the presence of the expected restriction fragments. 
Two clones containing the desired Xhol insert were identified and the orientation of the insert was determined by agarose 
35 gel analysis of EcoRi digests or Sphl digest of the plasmid DNA. Both isolates contained the desired insert in the same 
orientation and one was designated pGS4988 (Figure 22(a)). 

CLONING q-GLOBIN EXPRESSION CASSETTE IN pC1U 

40 The Xhol fragment from pGS4088 containing the a-globin expression cassette was purified by agarose gel elec- 
trophoresis and the recovered fragment dissolved (100ng/0.01ml) in imM Tris-HCI ph 7.8, 0.1mM EDTA. Twenty-five 
nanograms of the gel purified fragment was mixed with 10ng of Sail digested (phenol-chloroform extracted, ethanol 
precipitated) pC1 U in 0.01 ml ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 
25°C for 1 . 5 hr and a portion used to transform E.coli DH5a. Ampicillin resistant colonies were selected on LB-ampicillin 

45 plates. Plasmid DNA was prepared from 12 transformants, digested with Hindlll and analyzed by agarose gel electro- 
phoresis. The two possible orientations were isolated and given the designation pGS4488 (Figure 22(b)) and pGS4688 
(Figure 22(b)). 

CONSTRUCTION OF DIPLOID STRAINS EXPRESSING a- AND 6-GLOBIN CHAINS FROM SEPARATE PLASMIDS 

so 

S. cerevisiae RSY330 (MAa pep4-3 prb1-112 hist7 ura3-52 trpl-289 can1 gall) was transformed (Ito et al.) with 
pGS4488 or pGS4688. Trp + transformants were selected on SD-trp medium and streaked for single colonies. S. cerevi- 
siae BJY1991 (MATaprb1-112pep4-3 Ieu2-3,1 12 trp1- 101 ura3-52 gal2 can 1) was transformed with pGS4988. LIRA* 

transformants were selected on SD-ura medium and streaked for single colonies. These strains were each tested for 

55 the production of a- and p-globin as follows: (1) Single colonies were picked from SD-selective medium and used to 
inoculate 2ml of SD-selective (liquid) medium. The cultures were incubated for 24 hr at 30°C and diluted into 25ml of 
fresh SD-selective medium and incubated for an additional 24 hr. The cells were collected by centrifugation and resus- 
pended in 25ml of YP-galactose (2%) and the incubation continued for an additional 24 hr. The cells were harvested 
(8,000 x g, 10 min) and the pellet washed with 50ml 0.0 10M TrisHCI(pH 7.8), 0.001 M EDTA. The pellets were dissolved 
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by heating to 96°C for 10 min in SDS-PAGE sample buffer and the debris removed by centrifugation (15,000 x g, 10 
min). The cleared supernates from 1x10 6 cells each were analyzed by SDS-polyacrylamide gel electrophoresis and 
Western immunoblotting as described above, p-globin was readily detectable in extracts from BJY3505[pGS4988]. We 
were also able to detect a-globin cross-reacting material, although the signal strength was considerably weaker. The 

5 production of tetromeric hemoglobin requires the presence of both a and p chains, ideally these would be expressed in 
the same cell. Because strains BJY3505, BJY1991 and RSY330 are haploids they each can be mated with a yeast 
strain of the opposite mating type. Strains RSY330 and BJY1991 are both mating type a, whereas BJY3505 is mating 
type a. BJY3505[pGS4988], RSY330[4688] or BJY 1991 [4688] or RSY330[pGS4688) matings were done and diploids 
selected by streaking onto SD minimal medium with no additional amino acids or other nutrients. Neither of the plasmid- 

w bearing parental strains are capable of growth on this medium, diploids, however, can grow. Because the diploids are 
homozygous for the mutations in TRP1 and URA3 both plasmids must be present for the cells to grow in the absence 
of these nutrients. These diploid strains were analyzed for the synthesis of a and p-globin as described above. A most 
surprising result was obtained. Although a-globin and p-globin are synthesized at low levels in the haploid strains, co- 
expression in a diploid strain results in a substantial increase in the levels of both chains. Furthermore, after induction 

15 (24 hr) with galactose the cell pellets develop a distinct, pink-red color. These results suggest that: (a) co-expression of 
a-and p-globin stabilizes the two proteins, perhaps as a consequence of their interaction and (b) the protein is apparently 
folding and incorporating heme. 

Example 22: Oxygen Binding Properties of Yeast- Derived Hemoglobin 

20 

The P 50 of the yeast derived hemoglobin is nearly identical to that of Hgb Ao. Depending on how we measure it, 
the P 50 ranges from about 5 to 10 torr in a phosphate free solution. At 25°C it is about 4-6 torr in 50mM BisTris, pH 7.4, 
NaCI 0.1 M. In the same solution, at 37°C, the P 50 is from 8.5 to 1 1 . 

25 Example 23: Expression of Pi- Alpha Hemoglobin in S.Cerevisiae 

Methods 

Unless stated otherwise, all enzymes (restriction endonucleases, T4 DNA ligase, T4 DNA polymerase, T4 polynu- 

30 cleotide kinase) were purchased from New England Biolabs, Pharmacia, BRL, Stratagene or Boerhinger Mannheim. 
Restriction enzymes and T4 DNA ligase were used with the buffers supplied by the manufacturers. 

Ethanol precipitation of nucleic acids was carried out by the addition of 0.5 volumes of 7.5M ammonium acetate 
and 2 volumes of 20:1 ethanol-isopropanol. The pellet was collected by centrifugation at 14,000 x g for 15 minutes, 
washed twice with 80% ethanol, once with 95% ethanol and dried in vacuo. Phenol extractions were done by the addition 

35 of 50:49:1 mixture of phenol:ch!oroform:isoamyl alcohol. Phases were separated by centrifugation at 14,000 x g and 
the aqueous phase collected. Plasmid DNA was purified from E.coli DH5a as described by Birnborn and Doly (Nucleic 
Acids Research 1 979, 7:151 3-1 520). Electrophoretic analysis of DNA was carried out in agarose gels using tris-acetate 
electrophoresis buffer (Maniatis, et al. Molecular Cloning. Cold Springs Harbor, NY, 1 982). DNA was visualized by stain- 
ing the gels with 0.5fig/ml ethidium bromide and exposing the gels to ultraviolet light. DNA fragments were purified from 

40 agarose gels using a kit purchased from BIO-101 . DNA fragments were purified from acrylamide gels by crushing the 
excised gel fragment in 3.25M ammonium acetate and incubating overnight at 37°C. Gel fragments are removed by 
centrifugation (14,000 x g, 15min) and the DNA precipitated with ethanol. The precipitate is dissolved in TE (10mM Tris 
HCI, pH7.8, 1mM Na3 EDTA). Acrylamide gel electrophoresis of DNA was done as described by Maniatis, etal. (Molec- 
ular Cloning. Cold Spring Harbor Laboratory, NY, 1982). Bacteriological growth media and DNA transformation methods 

45 are as described by R.W. Davis, et al. (Advanced Bacterial Genetics. Cold Spring Harbor Laboratory, NY, 1 980). Media 
for the growth of S.cerevisiae has been described by F. Sherman £i aL (Methods in Yeast Genetics: A Laboratory Manual. 
Cold Spring Harbor Laboratory, NY, 1979). Transformation of S.cerevisiae with linear or circular DNA was carried out as 
described by H. tto, et al. (J. Bacteriology, 153:163-168(1983)). 

Removal of the Pstl and Spel sites from pGS4888 . The design of the synthetic linker for joining two a-globin chains 

so allows the inclusion of Pstl and Spel sites flanking a 30bp sequence that includes the junction of the two a-globin coding 
sequences. Because we anticipate testing several different linker sequences, these sites will allow directional cloning 
of relatively short synthetic oligonucleotides encoding different linker sequences. Removal of the Pstl and Spel sites 
from the vector sequence is, therefore, necessary so that the sites in the coding region are usable. One fig of the plasmid 
pGS4888 was digested with Pstl and ethanol precipitated. The dry pellet was resuspended in 50|x1 of 33mM Tris-acetate. 

55 pH7.9, 66mM potassium acetate. 10mM magnesium acetate, 0.5mM DTT and 50 juiM of each dNTP (T4 polymerase 
buffer). Two units of T4 DNA polymerase were added and the reaction mixture incubated for 15min at 37°C. Na3 EDTA 
was added to 12.5mM and the reaction mixture heated to 65°C for 15min, phenol extracted and ethanol precipitated. 
The dry pellet was dissolved in 1 4 julI of T4 DNA ligase buffer (BRL) and 1 jil (10 units) of DNA ligase added. The ligation 
mixture was incubated a 4°C for 16hr. A portion of the ligation reaction was used to transform E.coli DH5a and trans- 
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formants were selected on LB-ampicillin plates. Plasmid DNA was prepared from 1 2 transfer ma nts. The DNA was ana- 
lyzed by agarose gel electrophoresis of Pstl digests. Five transformants had lost the Pstl site and one of these was 
designated pGSl889. The Spel site of this plasmid was removed as described above after digestion of pGSl889 with 
Spel. A plasmid was identified that had lost both the Pstl and the Spel site and was designated pGSl989. 
5 Strategy for Joining Two Copies of the g-globin cDNA. A fragment containing the 5'-363bp of the a-globin coding region 
can be excised from pGS4888 as an Ncol to ApaL1 fragment A second fragment containing nucleotides 109 through 
the 3 untranslated region can be removed as a Fokl to Sail fragment. These two fragments can then be joined together 
to create a single translation unit encoding two tandem copies the of a-chain cDN A by using a synthetic oligonucleotide 
adaptor (Table 8). The purification and assembly of these fragments is described below. 

io The plasmid pGS4888 was sequentially digested with ApaLI and Ncol and a ~365bp fragment (a-chain 1) purified 
by acrylamide gel electrophoresis. A second fragment of ~351bp (a-chain 2) was prepared by sequential digest with 
Fokl and Sail followed by acrylamide gel electrophoresis. Four oligonucleotides were synthesized (Table 8) and assem- 
bled into a I73bp linker (RGGV) containing an ApaLI and Fokl ends. Synthesis, purification and assembly of oligonu- 
cleotides was as described previously. This adaptor encodes an amino acid bridge linking the carboxy-terminus of a- 

is chaiM with the amino-terminus of a-chain2 as well as portions of the 3' end of a-chain1 and the 5' end of a-chain2. The 
carboxy terminal arginine residue of a-chain1 is separated from the amino terminal valine of a-chain2 in this construct 
by 2 glycine residues. The "diglycine bridge" portion is flanked by Hpal, Spel and Pstl sites. These sites allow the sub- 
stitution of a variety of bridges by the use of ~30bp adaptors to connect the two a-chains into an expression cassette 
was carried out in a four part ligation as described below. 

20 Plasmid pGS1989 was sequentially digested with Ncol and Sail and the large fragment containing the plasmid 

vector and pGALGAP was purified by agarose gel electrophoresis (gp1989). Fifty nanograms of gp1989 were mixed 
with 200ng each of the gel purified ApaLI -Ncol a-chain1 fragment and the Fokl-Sall a-chain2 fragment A twenty-fold 
molar excess of the synthetic ApaLI -Fokl adaptor was added (the S'-ends of the adaptor segment are not phosphor- 
ylated). The ligation reaction was carried out in a volume of 20pl for 4hr at 23°C. A portion of this reaction mixture was 

25 used to transform E. coli DH5a and ampicillin resistant colonies were selected. Transformants were patched to nitrocel- 
lulose filters and screened by hybridization with the ^P-labelled oligonucleotide AL2-as (Table 8). Five pMoles of oligo 
AL2-as were incubated in 20|xl of a solution containing 2 units of T4 polynucleotide kinase. 50mM Tris-HCI (pH7.6). 
10mM MgCI 2 .6mM DTT, 0.1mM spermidine, 0.1 mM EDTA, and 10pMoles of y^P-ATP (7000 Ci/mMole) for 2hr at 37°C. 
Filters were processed as described by Maniatis, et at. and the hybridization was done at 37°C in 5XSSC, 50% forma- 

30 mide, 1 OOjig/ml yeast tRNA and 2% SDS for 1 6hr. The filters were washed, sequentially, in 2XSSC and 1 XSSC at 55°C 
for 15min (twice in each, all wash solutions contained 1% SDS). Dried filters were exposed to X-ray film and colonies 
giving a hybridization signal were used to prepare plasmid DNA. Plasmid DNA was analyzed by restriction enzyme 
digestion to identify plasmids that contained inserts of a size consistent with two a-chains (Ncol-Sall digest) and that 
contained unique Pstl, Spel and Hpal sites. A plasmid identified in this manner was designated pGS21 89 (Figure 25). 

35 An Xhol fragment from pGS3888 containing the p-globin expression cassette was purified by agarose gel electro- 
phoresis. Xhol digested pGS2189 (50ng) was combined with 150ng of the gel purified insert from pGS3888 in 1|iil of 
ligation buffer containing 10 units of T4 DNA ligase. A portion of this mixture was used to transform E.coli DH5a and 
ampicillin resistant colonies were selected. Plasmid DNA was isolated and analyzed by digestion with Xhol, BamHI or 
Ncol. Several plasmids were identified that produced restriction fragments of the expected sizes, all contained inserts 

40 in the orientation shown in Figure 25. One of these was designated pGS2989. Although this plasmid contains the linked 
a-globin genes and a p-globin gene under the control of separate promoters, it is not capable of replication in S.cerevisiae. 
The entire expression cassette, containing the two genes (dia and p-globin), can be purified as a Notl fragment Ten jmg 
of pGS2989 was digested with Pvul and Notl and the Notl fragment gel purified. The digestion with Pvul was done to 
reduce the size of the vector sequences which otherwise would comgrate with the desired Notl fragment Two hundred 

45 ng of the gel purified Notl fragment was combined with 50ng of Notl digested pC1 N in 1 OjiL of ligation buffer containing 
10 units of T4 DNA ligase. The reaction mixture was incubated at 4°C overnight and a portion was used to transform 
E.coii DHSct. Ampicillin resistant transformants were selected and plasmid DNA prepared. DNA was digested with Ncol- 
Sall, Pstl or Notl to identify plasmids with the dia, p-globin expression cassette and to determine the orientation of the 
inserted fragment. Several plasmids were identified that contained the correct insert, all of which have the inserted 

so fragment in the same orientation. One of these was designated pGS3089 (Figure 25). This plasmid was used to transform 
strains GSY1 12 and RSY334. 

Expression and Purification 

55 S.cerevisiae strains GSY1 1 2[pGS3089] and RSY334[pGS3089] were grown to saturation in SD-uracil medium and 
diluted into 2L of YPD medium (all cultures were incubated at 30°C). Twenty-four hours after inoculation (of the YPD 
culture), galactose was added to 1 % and the cultures incubated for another 24hr. Carbon monoxide was bubbled through 
the culture and the cells collected by centrifugation. A 1 :1 mixture (weightvol) of cells and breakage buffer (1 OmM sodium 
phosphate (pH7.2), 1mM EDTA, 1mM EGTA and 5mM DTT) were disrupted in a "Bead-Beater" (Biospec Products, 
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Bartlesville, OK). Debris was removed by centrifugation (10,000 x g. 30min), The soluble fraction was adjusted to pH6.0 
with phosphoric acid and chromatographed on a column of S-sepharose (fast flow) equilibrated in 10mM sodium phos- 
phate pH6.0. The loaded column was washed with 1 0mM Tris-HCI, pH6. 7 and hemoglobin eluted by washing with 20rnM 
Tris-HCI, pH7.5. A bright red band was collected and the pH adjusted to 8.0, by the addition of NaOH. This material was 
then chromatographed on Q-sepharose (fast flow) equilibrated in 20mM Tris-HCI, pH8.0. Hemoglobin was eluted with 
a NaCI gradient (0-0.4M). A final chromatography step was carried out on Sephacryl S-200 equilibrated in 5mM NaP0 4 , 
pH7.4, 0.1 M NaCI. Each step of the purification protocol was analyzed by SDS-polyacrylamide gel electrophoresis (Lae- 
mmli, U.K.. 1970, Nature 22Z:680-685) and staining with Commassie brilliant blue. The purified protein contains a band 
that comigrates with monomer p-globin and a band in the expected position for a-globin dimer. This material comigrates 
with human tetrameric hemoglobin when analyzed by size exclusion chromatography (Progel TSK G3000 SWXL HPLC 
column). This protein is red and binds O z with a 50% binding affinity (P50) of 8-1 0 Torr, indicating that it has incorporated 
heme and is capable of reversible 0 2 binding. 

The purified protein was separated into a- and p-chains by reverse phase HPLC and the sequence of the 1 0 amino- 
terminal residues of each chain were determined. The sequence matched that of bona fide human hemoglobin, indicating 
that the initiating methionine had been efficiently removed from both the a-globin dimer and the p-globin dimer. 

Example 24: Construction of Low Affinity. Genetically Cross-Linked. Hemoglobin Mutants and Expression in Yeast 

Constructi on of vector for site directed mutagenesis. The Xhol fragment containing the p-globin gene and its GAL- 
GAP promoter waas isolated from pGS2989 by preparative agarose gelelectrophoresis and ligated with Xhol digested 
Phagescript (RF-form, obtained from Stratagene, Inc.). E. coli XL1 -Blue was transformed with the DNA ligation mixture 
and phage containinginserts were identified (white plaques on medium containing XGAL). Single plaques were isolated 
and DNA was prepared and analyzed by digestion with Xhol and agarose gel electrophoresis. This construct was des- 
ignated Mpp-globin. 

Preparatio n of single stranded template . A saturated culture of E. coli XL1-Blue (available commercially from Strat- 
agene. 11099 North Torrey Pines Road, LaJolla, CA 92037) (200 was used to inoculate 4 ml of 2X YT broth. This 
culture was incubated for 2 hr. at 37°C and then infected with a single M13p-globin phage plaque and the incubation 
continued for 6-8 hrs. Cells were removed by centrifugation and discarded. Phage were precipitated from 1.6ml of clarified 
medium by the addition of 320 \i\ of cold 30% PEG 8000 in 3.5M ammonium acetate followed by incubation on ice for 
30 min. Phage were collected by centrifugation and resuspended in 0.1 ml of 10mM Tris-HCI pH 8.0, 1mM EDTA (TE). 
DNA was isolated by extracting twice with phenol/cholorform. DNA (contained in the aqueous phase) was precipitated 
by the addition of NaCI to 0.5M and two volumes of 95% ethanol. The DNA pellet is dissolved in 20\i\ of water. 

In vitro mu tagenesis reactions . Two hundred ng of template DNA are mixed with a twenty-fold molar excess of the 
appropriate, phosphorylated mutagenic oligonucleotide in 10 julI of 20mM Tris-HCI, pH 7.4, 2mM MgCI 2( 50mM NaCi and 
heated to 70 °C for 5 minutes. The annealing reaction is allowed to slowly cool (40 min) to 40°C and then to 30°C (15 
min). After the last annealing step the mixture is transferred to ice for 5 min. To this mixture 1 \i\ of synthesis buffer (4mM 
each dNTP, 7.5mM ATP, 175mM Tris-HCI pH 7.4, 37.5mM MgCI 2 215mM DTT), 0.5 \i\ of T4 gene32 protein (2 \igf\i\). 

of T4 DNA ligase (3 units), and 1 ml of T4 DNA polymerase (1 unit) are added. The mixture is incubated at 37°C for 
90 min (following 5 min at room temperature). The reaction is stopped by the addition of 90ml of 0.1 M Tris-HCI (pH 8.0) 
and 0. 1 M EDTA. Approximately 0 . 1 \i\ of the reaction mix was used to transf ect E. coli XL1 - Blue cells (Cells were prepared 
for transformation by the method of D. Hanahan, 1983. J. Mol. Biol. 166:557). 

Screening for phage containing the mutant aene. Approximately 50-1 00 plaques were picked to fresh plates seeded 
with the appropriate host strain (XL1 -Blue) in ordered arrays. After incubation for 6-8 hrs. at 37°C the plates were overlaid 
with nitrocellulosef ilters and prepared for hybridization essentially as described in Davis, R.W. et al. (Advanced Bacterial 
Genetics: A Manual of Genetic Engineering. Cold Spring Harbor Laboratory, New York 1 980). The choice of hybridization 
temperature with the mutagenic oligonucleotide was determined on the basis of the nucleotide composition of the oli- 
gonucleotide. The oligonucleotide was labelled with y^P-ATP and polynucleotide kinase (T Maniatis et al. Molecular 
Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory, 1982). Hybridizations were done at 2-5°C below the 
calculated Tm for the correct match in 6XSSC, 2%SDS, 100*ig/ml yeast tRNA using 105 cpm/ml of the labelled oligo- 
nucleotide for >6 hr. The Tm was calculated using the formula: 4°C for every GC pair + 2°C for each AT pair assuming 
an Na+ concentration of 1 M. Filters were washed at the same temperature as the hybridization in 5XSSC, 2% SDS and 
exposed to XRay film. Plaques giving positive hybridization signals were used to prepare single stranded DNA, as 
described above. The single-stranded DNA was used as template for sequencing reactions (Sanger, F. and Coulson, 
A.R. 1975, J. Mol. Biol. 94:441) to confirm that the mutant sequence was indeed present. — 

Oligonucleotides used for mutagenesis. 
PN1 OSK-Tresbyterian" 5'-AGGCTCCTGGGCAAGGTGCTGGTCTGT-3* 
pE90K-"Agenogi" 5'-GCCACACTGAGTAAGCTGCACTGTGAC-3* 
pV67l-(No Alias) S'-CATGGCAAGAAAATCCTCGGTGCCTTT-S' 
PN1 02T-*Kansas" 5 , -GTGGATCCTGAGACTTTCAGGCTCCTG-3 , 
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Cloning into yeast expression vectors . Phage RF DNA was prepared from phage-infected cells that had been 
confirmed to have the mutant sequence and the Xho! fragment containing the altered p-globin gene was purified by 
agarose gel electrophoresis. This fragment was cloned into a derivative of pGS3089 that had been altered to change 
the di-cc-linker from the diGly configuration to a single glycine bridge, and from which the p-globin gene had been deleted, 

5 and designated pGS3889desp (Figure 26). This created a unique Xhol site into which altered p-globin expression cas- 
settes could be inserted, allowing coexpression with the a-globin dimer. The expression plasmids generated in this way 
(GS3889 for "Presbyterian'*, pGS5189 for "Agenogi", pGS5689 for "Kansas'" and pGS4989 for "pV67l", all identical to 
pGS3089 except for mutation in specified codon) were used to transform S. cerevisiae strains GSY1 12 and RSY334. 
Characteristics of a genetically fused low affinity mutant protein expressed in veast . Cells bearing the plasmid 

io pGS3889 (single gly bridge, PN108K alias "Presbyterian") were grown and hemoglobin purified as previously described. 
This material when analyzed for functionality was substantially "right-shifted" compared to the fused protein with a wild 
type p-chain (P 50 =23-25 for the mutant with N = 2.5). 


75 


Example 25: Effect of Choice of Strain and Induction Temperature on Expression of Pi-Alpha Hemoglobin in E. coli 


Table 100 contains comparisons of di-alpha/beta fermentations. It also provides comparison of temperatures of 
induction. The column labeled "mg di-A+B" is total mg di-alpha and beta polypeptides per fermentation. The adjacent 
column "mg/OD-L" simply expresses the first column number on a cell density basis. The two columns labeled RHGB 
present total and cell density-corrected output of functional recombinant hemoglobin. The last column shows that in 
20 terms of final functional hemoglobin recovery, strain JM109 is preferable. Without binding ourselves to any theory, we 
believe that this difference has to do with the proteases expressed in different strains. It is interesting to note that of the 
two best JM109 runs, one induction was at 30° and one at 37° with roughly equivalent amounts of final functional Hb 
produced. 

25 Example 26: Construction of Genetically fused a-Globin Dimers Connected By a Single Glycine or Proline Residue 

The following synthetic adaptors for altering the dict-globin bridge were synthesized and purified as described in 
previous sections. 

30 TSKYRPVLSPA 

5' -A ACT AGT AAG TAC AG A CCT GTT TTG TCT CCT GCA-3 RPV 
3'-T TGA TCA TTC ATG TCT GGA CAA AAC AGA GG-5 ' 

35 Hpal PstI 

TSKYRGVLS P A 
40 5 1 -A ACT AGT AAG TAC AG A GGT GTT TTG TCT CCT GCA-3 1 RGV 

3 f -T TGA TCA TTC ATG TCT CCA CAA AAC AGA GG-5 * 

Hpal Pst I 

45 

Complementary pairs of RPV and RGV oligonucleotides were combined (2.4jig of each oligonucleotide) in 0.05 
ml of water; The two pairs of adaptors were precipitated (separately) by the addition of 2.jil of 4M NaCI and 0.158mL of 
1 00% ethanol followed by centrif ugation. The pellets were washed with 80% and 1 00% ethanol and dried. The adaptors 
so were dissolved in 24|il of TE. 

Cloning of t he RBGV and RPV adaptors . The plasmid of pGS2989 was sequentially digested with the enzymes 
Hpal and PstI and the vector was purified after agarose gel electrophoresis. The digested plasmid was then ligated with 
a 10-fold molar excess of either the RGV or RPV adaptor (these adaptors were not phosphorylated) as previously 
- described. A portion of the ligation mixture was used to transform E.coli DH5. Transformants were selected on LB plates 
55 containing ampicillin. Clones containing the new adaptor were identified by colony filter hybridization. The hybridization 
probes were either the RPV or RGV upper strands (as shown above). These proves were labelled with y32P-ATP and 
T4 polynucleotide kinase. Filters were hybridized with the appropriate probes (10 5 cpm/ml) at 37°C in a solution con- 
taining 6XSSC. 50%formamide, 2% SDS and 150|ag/mi yeast tRNA. Filters were incubated for >12 hrs. and washed 4 
times with 2XSSC, 2%SDS (250ml, 20min each) and exposed to X-ray film. Colonies that produced autoradiographic 
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signals were used to prepare plasmid DNA which was sequenced using the following primer: S'-AAGCTTCAGCACCG- 
TATTCA-3' (a 2 seql). This primer was specifically designed to minimize homology with the corresponding sequence in 
the a1 subunit of the dimer and to maximize homology with sequences near the 5' end of the a2 subunit of the diet dimer. 
This allows sequence to be determined reading from the a region through the sequence that bridges the two a-domains 

5 without a background sequence reading from a similar sequence near the 5'-end of the a1 -domain. The plasmids con- 
structed in this way were designated PGS2989RPV(single proline linker) or pGS2989RGV (single glycine linker). 

Cloning into veast expression plasmids . The Notl fragments from either pGS2989RPV or pGS2989RGV that contain 
the hemoglobin expression cassette were purified by agarose gel electrophoresis and subcloned into pC1 N that had 
been digested with Notl. Plasmid DNA from ampicillin resistant transformants was isolated and analyzed by digestion 

to with Notl and agarose gel electrophoresis. These plasmids were designated pGS3089RPV or pGS3089RGV. A second 
set of plasmids was generated from these to facilitate the substitution of different p-chain mutants. These plasmids were 
generated by digestion with Xhol and religation under dilute conditions <1fig/ml). This favors deletion of the p-chain 
expression cassette. Plasmid DNA was isolated from ampicillin resistant transformants and the structures confirmed by 
digestion with Notl and Xhol (these plasmids have been designated pGS3089RGV-desp, see Figure 26, and 

is pGS3089RPV<iesp). 

Reference Example: Reconstitution of Recombinant Aloha-Globin and Recombinant Beta-Globin with Heme and chem- 
ical Reduction to Yield Artificial Hemoglobin 

20 Conventional methods of preparing artificial hemoglobin are exemplified by the following procedure. 

The lyophilized recombinant alpha and beta-globins (100 mg each) were individually dissolved in 8M urea/50mM 
Tris-Cl , pH 8.01/1 mM EDTA/ 1mM DTT, diluted to a concentration of and incubated at room temperature for 3-4 hours. 
The alpha-globin was then diluted to 0.3mg/ml with chilled 20mM K 2 HP0 4l pH 5.7/1 mM EDTA/1 mM DTT. Hemin (25 
mg) was dissolved in 2.4mg 0.1 M KOH, diluted with an equal volume of 1 M KCN; this solution was then made 0.1 mg/ml 

25 in hemin and 20mM K 2 HP0 4 , pH 6.7 with stock phosphate buffer. Hemin from this solution was added to a 2.8 molar 
excess to the chilled alpha-globin; and equal molar amount of beta-globin was added and the solution was dialyzed at 
4°C overnight against 0.1 M K 2 HP0 4 , pH 7.6/1 mM EDTA/1 mM KCN. The artificial Hb solution was concentrated by ultra- 
filtration using a PM- 1 0 membrane ( Amicon) and transferred into a 200ml screw-top test tube with a rubber septum. The 
hemoglobin solution was deoxygenated by evacuation and flushing with N 2 , and then the solution was saturated with 

30 CO. 100mM sodium dithionite solution was prepared anaerobically in a 20ml screw-top test tube with rubber septum. 
4.5 equivalents of dithionite were added to the Hb solution with a syringe, and the mixture incubated on ice for 15 min. 
The Hb solution was gel-filtered against 1 0mM Na phosphate buffer pH 6.0 on a 4x40 cm Sephadex G-25 (fine) column. 
The colored solution was then applied to a 2x10 cm-52 (Whatman) column equilibrated with the same buffer and the 
chromatography was developed with a linear gradient of 500ml lOmM Na phosphate buffer pH 6.0 and 500ml of 70mM 

35 sodium phosphate buffer pH 6.9. CO was removed from Hb by photolysis under a stream of oxygen. Artificial Hgb 
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prepared this way is isolated in only about 25% yield from the fusion peptides but shows native oxygen binding properties. 
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Tat? l g 2; gllqgnyciyptide S»qugnc«« n S ed to create rn- ft 1rrn 

Globm Genes ' 

a* (Arg)-Gly-Met:(L«u>a 2 Linker 

BstiBl Eagl 

CG AAA TACCGTGGTATGCTGTCTCC 

TTTATGGCACCATACGACAGAGGCCGG 

a* (Arg)-Gly-Gly-(Val)a 2 Linker 


« CGAAATACCGTGGTGGTGTTCTGTCTCC 

TTTATGGCACCACCACAAGACAGAGGCCGG 

a 1 (Arg)-Gly-(Val)a 2 Linker 

20 BstBl Eagl 

CGAAATACCGTGGTGTTCTGTCTCC 

TTTATGGCACCACAAGACAGAGGCCGG 

a 1 (Arg)-Gly-Gly-(Leu)a 2 

BstBl ^ x 

CG AAA TACCGTGGTGGTCTGTCTCC 

TTTATGGCACCACCAGACAGAGGCCGG 


a' (Arg)-(Val)a 


CGAAATACCGTGTTCTGTCTCC 

TTTATGGCACAAGACAGAGGCCGG 
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Table 4 Pi* Values for Hemoglobin-Like P roteins 

Hemoglobin P T ft 

des-val Hqb 10.2 

dialpha ( arg-gly-net) Hgb 9.0 
dialpha 

b#ta 4f val — >ile Hgb 16.0 

dialpha 

beta 4 7 val — >ile/ 

b«ta* 2 lys — >arg Hgb 16.0 

dialpha ( arg-gly-gly-val ) Hgb 16.0 


Table 5. Synthetic Oligonucleotides Used For The Synthesis of 


1 TCCACTGAAA XAAAAGGTTT AAACCAGTTC CCTGAAATTA TTCCCCTACT 
TGACTAATAA GTATATAAAG 

2 CAATACCTAC CGTTTATATA CTTATTAGTC AAGTAGGGGA ATAATTTCAG 
GGAACTGGTT TAAACCTTTT TTTTCAG 

3 ACGGTACGTA TTGATTGTAA TTCTGTAAAT CTATTTCTTA AACTTCTTGA 
ATTCTACTTT TATAGTTAGT CTTTTTTTTA GTTTT 

4 AAGTTCTtCG TGTTTTAAAA CTAAAAAAAA GACTAACTAT AAAAGTAGAA 
AGAAGTTTAA GAAATAGATT TACAGAATTA CAAT 

5 AAAACACCAA GAACTTAGTT TCGAATAAAC ACACATAAAT AAACCATGGT 
TAACT 

6 CTAGAGTTAA CCATGGTTTA TTTATGTGTG TTTATTCGAA ACT 


Table 6. Synthetic Oligon ucleotides Used For The S ynthesis 

Of The Galactose Upstream Activator 

1 CGTACGGATTAGAAGCCGCCGAGCGGGTGACAGCCCTCCGAAGGAAGACTC 

TCCTCCGTGCGTCCTCGTC TTCACCGGTCGC 

2 AGGACGCACGGAGGAGAGTCTTCCTTCGGAGGGCTGTCACCCGCTCGGGG 

CTTCTAATCCGTACGCATG 

3 GTTCCTGAAACGCAGATGTGCCTCGCGCCGCACTGCTCCGAACAAT 

AAAGATTCTACAATACTAGCTTTT ATGGTTATGAAGAGGAAAAT 

4 ATAACCATAAAAGCTAGTATTGTAGAATCTTTATTGTTCGGAGCAGTGCG 

GCGCGAGGCACATCTGCGTT TCAGGAACGCGACCGGTGAAGAC 

5 TGGCAGTAACCTGGCCCCACAAACCTCAAATGAACGAAATCAAATTA 

ACAACCAGATATC 


6 


TCGAGATATCTGGTTGTTAATTTGATTCGTTCATTTGAGGTTTGTGG 
GGCCAGGTTACTGCCAATTTTCCTCTTC 
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Table 7 


w 


15 


20 


25 


30 


35 


Codon Preferences in Yeast 

Ala 

GCU, GCC 

Ser 

UCU, UCC 

Thr 

ACU, ACC 

Val 

GUU, GUC 

lie 

AUU, AUC 

Asp 

GAC 

Phe 

UUC 

Tyr 

UAC 

Cys 

UGU 

Asn 

AAC 

His 

CAC 

Arg 

AGA 

Glu 

GAA 

Leu 

UUG 

Lys 

AAG 

Gly 

GGU 

Gin 

CAA 

Pro 

CCA 

Met 

AUG (No alternative codons) 

Trp 

UGG (ditto) 

Source: Bennetzen and Hall, J. Biol. Chem., 257:3026-31 (1982). 
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Table 8 

OLIGONUCLEOTIDES ITS ED FOR T HE CONSTRUCTION OF A GENE 
ENCODING A TANDEM AT.PHA-GLOBTN DTMER . 

AL-1SS 

5 1 -tgcacgcttctttggacaagttcutggcttctgtttctactgtgttaactagtaagt 
acagaggtggtgttttgtctcctgcagacaagactaac-3 * 

AL-2SS 

5 '-gttaaggctgcttggggtaa 
ggaaaggatgttcttgtct-3 * 

AL-1AS 

5 •-tgcaggagacaaaacaccacctctgtacttactagttaacacagtagaaacagaagcca 
agaacttgtccaaagaagcg-3 1 
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AL-2AS 

5 • -ggaaagacaagaacatcctttccaaagcttcagcaccgtattcacccagcgtgagcacc 
aaccttaccccaagcagccttaacgttagtcttgtc-3 1 

Note: NcoI-ALPHA-l-ApaLl; FoKl -ALPHA- 2 -Sail ; ApaLI-RGGV-Fokl 

lablfi 2J OXVqen Affinity of Recombinant Mu tant Hotno^frjp 

generated From FX-Hab an d Trvptic Digestion 

Oxygen affinity measured at 37 *C in 50 mJ3 Bis-Tris, 
pH 7.4, ' o.l il NaCl on a Hemox-Analyzer. Solutions were 60 ujj 
in heme and measured between 130 and 1.2 torr oxygen tension. 


HgB A D 9.5 

r HgB A 0 9.2 

HgB Providence 10.2 

Hgb Kansas 11.3 

HgB (beta 67 val — >ile) 22.4 


The value for Hgb A 0 is of course for free hemoglobin 
in solution. The P 5 0 of whole blood is much higher. 


Table 10 


Effects of NaCI and Inositol Hexaphosphate on Oxygen Binding to Hemoglobin Aq and Recombinant des-Fx Hgb 


Pso 

Pso 


0.1 M NaCI 

0 M NaCI 

2.2 mM IHP 

0 M 1HP 

HgbAo 

6.6 

2.8 

51.1 

6.6 

des-Fx Hgb 

4.9 

3.9 

5.5 

4.9 


Table 1 1 


Distribution of FX-Alpha, FX-Beta and FX-Hgb in E. coli. 


milligrams of protein per OD-lrter of E. coli 


soluble 

insoluble 

FX-Alpha 

36 

0 

FX-Beta 

21 

21 

FX-Hgb 

188 

0 
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TABLE 100 


15 


25 


35 


FERM DATA 

RFE 

STRAIN 

INDUCT TEMP 

FINAL OD 

MGDI-A+B 

MG/OD-L 

MG RHGB 

1 

BL21 

30.0 

6.6 



0.0 

5 

BL21HA 

37.0 

9.3 



0.0 

6 

JM109 

37.0 

7.1 

234.0 

16.5 

5.5 

7 

JM109 

37.0 

11.3 

372.0 

16.5 

25.0 

8 

JM109 

37.0 

6.6 

442.0 

33.5 

49.0 

10 

SCSI 

37.0 

15.2 

754.0 

24.8 

38.0 

11 

JM109 

25.0 

11.5 



26.0 

12 

JM109 

25.0 

9.5 



31.0 

13 

JM109 

30.0 

9.0 



64.0 . 

17 

JM109 

37.0 

14.4 

- 


51.2 

25 

JM110 

30.0 

20.5 



25.0 

26 

LE392 

37.0 

23.0 

1 040.0 

22.6 

4.1 

27 

JM110 

30.0 

27.0 

334.0 

6.2 

47.0 

30 

JM110 

37.0 

23.6 



16.0 

31 

23722 

30.0 

25.4 


>• 

59.0 

35 

W3110 

30.0 

18.6 



13.0 

38 

AG-1 

30.0 

13.0 



57.0 

49 

DH-1 

30.0 

4.0 



7.7 

50 

NM554 

30.0 

14.0 



56.0 

51 

NM554 

37.0 

14.5 



18.0 
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TABLE 2 00: Bacterial and Yeast Vectors 

DEFINITIONS: ROP, Gene which regulates plasid copy number 

ROP+-Low copy number 
rop— High copy number 

?S' ^i™ 1 ^ resist * nc * *sed for plasmid selection 
selection tetracycline resistance used for plasmid 

35'* tG «5 aC r^ ine sens **ive, TR gene not functional 
FX-A, FX-alpha gene 

FX-B, FX-beta gene 

DFX— A , Des-FX alpha globin gene 

DFX-B, Des-FX beta globin gene 

DV-a, Des-Val alpha globin gene 

DV-B, Des-Val beta globin gene 

RV-Di-alpha f Di-alpha gene containing no amino acid 
spacer (R=Argxnine; V-Valine) 

RGV-Di-alpha, Di-alpha gene containing single glycine 

(G) linker followed by a valine 
RGM-Di-alpha, Di-alpha gene containing single glycine 

linker followed by a methionine (M) 

linker i " alPha ' Di " alpha gene containing two glycine 
glycine li!£^~ Di " alEba ' Di - al P ha containing three 

LACI, gene encoding repressor protein which regulates 
TAG promoter 

I*AC+-repressor gene on plasmid 
LAC*""© reporessor gene on plasmid 


All bacterial plasmids listed below which contain alpha, di-alpha and/or beta genes also have translation^ couplers. 
The pPL expression system translationally couples the lambda N protein gene to a globin gene. 

pKK223-3 

Parental plasmid obtained commercially from Pharmacia LKB, 800 Centennial Ave., P.O. Box 1327 Piscataway, N.J. 
08855-1 327; all remaining plasmids derived from this parental plasmid. Has TAC promoter followed by a poly-restriction 
site region to facilitate gene insertion. 
AR, TS, ROP+, LAC- 

1. pDLII-62m 
pKK223-3 containing FX-A 
AR, TS, ROP+, LAC- 

2. pDL N-lOa 

pKK223-2 containing FX-B 
AR, TS, ROP+. LAC- 

3. PDLII-66A 

— Parental plasmids are 1 and 2; contains both FX-A and FX-B in single operon - 
AR, TS. ROP+, LAC- 

4. pGEM FX-A 

Parental plasmids are 1 and pGeml which is commerically availablef rom Promega Corporation, 2800 Woods Hollow 
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Rd., Madison, Wl 53711. pGeml containing FX-A 
AR 

5. pGEM FX-B 

5 Parentals are 2 and pGeml 

pGeml containing FX-B 
AR 

6. pDLII-83a 

io Parental is 4; contains DFX-A 

7. pDLIII-6f 

Parental is 5; contains DFX-B 
AR 

8. pDL II-86C 

Parentals are pKK223-3 and 6 
contains DFX-A 
AR, TS, ROP+, LAC- 

20 

9. pDLIII-13e 
Parentals are 7 and 8 

pKK223-3 containing both DFX-A and DFX-B 
AR, TS, ROP+, LAC- 

25 

10. pDL ll-91f 

Parental is 4 contains DV-A 
AR 

30 11.pDLII-95a 

Parental is 7 contains DV-B 
AR 

12. pDLIII-1a 

35 Parentals are pKK223-3 and 1 0 
contains DV-A 
AR, TS, ROP+, LAC- 

13. pDLIII-14c 

40 Parentals are 1 1 and 12 
contains DV-A and DV-B 
AR, TS, ROP+, LAC- 

14. pDL lll-47a 
45 Parental is 1 3 

contains RGM-DIALPHA and DV-B 
AR, TS, ROP+, LAC- 

15. pDLIII-82A 
so Parental is 13 

contains RGGV-DIALPHA and DV-B 
AR, TS, ROP+, LAC- 

- 16. pDLIV-8a - 

55 Parental is 13 

contains RGV-DIALPHA and DV-B 
AR, TS, ROP+, LAC- 
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17. pDLIV-47b 
Parental is 1 3 

contains RV-DIALPHA and DV-B 
AR, TS, ROP+, LAC- 

5 

18. pDLIV-66a 
Parental is 13 

contains RGGGV-DIALPHA and DV-B 
AR, TS, ROP+, LAC- 

10 

19. pDLIV-3a 
Parental is 15 

ROP gene is inactivated by insertion of a Not I linker into the Pvu II site within the ROP gene 
AR, TS, ROP-, LAC- 

15 

20. pDL IV-38a 
Parental is 1 5 

contains the Nagai mutation in DV-B 
AR, TS, ROP+, LAC- 

20 

21 . pDL IV-58f 
Parental is 20 

ROP gene inactivated as in 19 
AR, TS, ROP-, LAC- 

25 

22. pDL IV-59a 

Parental is 21 and pBR322 which is commerically available from a number of different suppliers, 
contains a functional TR gene constructed in the following manner: 

The Eco R1 site of pBR322 was changed to a Bam H1 site by insertion of a Bam H1 linker. This permitted removal 
30 of the 5' end of the TR gene from pBR322 as a Bam H1 fragment. This fragment was then inserted into the Bam 
H1 site located at the junction of the TAC promoter and the inactive TR gene of pKK223-3. Insertion of this fragment 
reactivates the TR gene if the fragment inserts in the proper orientation. Selection of colonies on tetracycline plates 
assures presence of the fragment in the proper orientation. 
AR, TR, ROP-, LAC- 

35 

23. pJR V-83a 
Parental is 11 

contains DV-B with the Presbyterian mutation (asnl 08 to lys). 
AR, TS, ROP+, LAC- 

40 

24. pJRVI-29a 
Parentals are 1 5 and 23 

contains RGGV-Di ALPHA and DV-B with Presbyterian mutation 
AR, TS, ROP-, LAC- 

45' 

25. pJR Vl-53b 
Parental is 24 

made TR by insertion of BAM H1 fragment 
AR, TR, ROP+. LAC- 

50 

26. pJR Vl-61a 
Parental is 25 

made ROP- by insertion of Not I linker into Pvu II site 
AR, TR, ROP-, LAC- - - 

55 

27. pDL V-4a 
Parentals are 16 and 26 

contains RGV-DIALPHA and DV-B with Presbyterian mutation 
AR, TS, ROP-, LAC- 
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28. pDL V-10a 
Parental is 27 

insertion of Bam H1 fragment to convert to TR 
AR, TR, ROP*. LAC- 

29. pDLV-16d 
Parental is 28 

contains LACI gene inserted into Not I site LACI gene obtained using following protocol: 

Polymerase chain reaction (PCR) primers containing NOT I sites at their 5' ends were used to amplify the lac I gene 
Following gel purification the gene was inserted into the NOT I site of pDLV-1a AR, TR, ROP-, LAC+ 

Several other plasmids constructs have been designed to facilitate the incorporation of a second beta globin gene 
under regulation of its own TAC promoter. 

30. pDL IV-64a 
Parental is 14 

contains beta globin under regulation of a synthetic TAC promoter 
AR. TS, ROP+, LAC- 

31. pDLIV-67a 

Parental plasmids are 1 4 and 30 

contains D I ALP HA under regulation of one pTAC and DV-B under regulation of a second pTAC 
DV-B is adjacent to D (ALPHA 
AR. TS, ROP-, LAC- 

32. pJR Vl-54a 

Parental plasmids are 14 and 30 

contains DIALPHA and DV-B under regulation of one pTAC and a second DV-B under regulation of another pTAC 
The second DV-B is inserted into the Pvu II site of the plasmid 
AR, TS, ROP-, LAC- 

33. pPL Lambda, 

Commercially available plasmid from Pharmacia LKB (see above) contains pL promoter and coding region for N 
protein of lambda which can be used for expression of fusion or translationally coupled recombinant proteins. 

34. pPL-alpha/beta 

Parental plasmids are 13 and 33 
contains DV-A and DV-B 
AR, ROP+ 

35. pPL-diaipha/beta 
Parental plasmid is 34 
contains RGV- DIALPHA and DV-B 
AR, ROP+ 

36. pSGE0.1-LO 
Parental plasmid is 35 

ROP gene inactivated by insertion of Not I linker into Pvu II site in ROPgene 
AR, ROP- 

37. pSK+ 

Commercially available from Stratagene, LaJolla, Ca. 

38. pGS2488 

Derived from (37) by insertion of synthetic GAP 491 transcriptional initiation site. 

39. pGS2888 

Derived from (38) by conversion of Kpnl site to Sphl site. 
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40. pGS4788 

Derived from (39) by insertion of synthetic GAL UAS into (39) to form GALGAP hybrid promoter. 

41. pLC IIFX-p-globin 

Plasmid available from Kiyoshi Nagai, Medical Research Council, London, England; bears p globin gene 

42. pUC19 

Plasmid commercially available from Bethesda Research Laboratories, Gaithersburg, Maryland 

43. pSUC2-6Z 

Plasmid described by Stetler, etal., Biotechnology, 7:55-60 (1989) 

44. pUCI9p-globin 
Derived from (41 ) and (42) 

45. Plasmid pGS 11 88 

p-globin gene (from 44) is under control of the sigma promoter and the MFX terminator (both from 43). 

46. Plasmid pGS3588 

Derived from (45) and (40); sigma promoter replaced by GALGAP promoter of (40). 

47. Plasmid pGS3888 

Derived from (46) by conversion of Smal site to Xhol site. 
47a. Plasmid pct-MRC 

Plasmid available from K. Nagai, MRC; bears alpha-globin gene. 

48. Plasmid pGS4088 

Derived from (47) by insertion of a-globin gene, replacing the p-globin gene. 

49. Plasmid pSN(+) 

Derived from (37) by changing Kpml site to Notl site. 

50. Plasmid pGS4888 

Derived from (49) by insertion of pGGAP-a-globin expression cassette from (48). 

51. Plasmid pGS1 89 

Derived from (50) by insertion of pGGAP -p-globin expression cassette from (47). Plasmid thus bears both GALGAP 
a globin and GALGAP p globin operons. 

52. Plasmid pC1U 

Plasmid described in Stetler, et al., Biotechnology, 7:55-60 (1989) 

53. Plasmid pClN 

Derived from (52) by addition of Notl site. 

54. Plasmid pGS289 

Derived from (53) by insertion of a-globin and p-globin expression cassettes. 

55. Plasmid pGS389 

Ditto, but with insert orientation reversed. 

56. Plasmid pYRp7 

Plasmid described in Strathern. et al., The Molecular Biology of the Yeast Saccharomvces (Cold Spring Harbon 
1981) 

Source of TRP1 gene. 


fvmr>aa7Ai t 
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57 pClT 

Derived from (52) and (56). 

Replacing Ura3 gene (52) with Trp1 gene (56). 

58 

Derrved from (5") t> insertion of p-globin expression cassette from (47). 

59 PlasmxJ pGS4488 

Derrved from (ST) by insertion of a-globin expression cassette of (48). 

60 PlasmxJ pGS4fc88 

Ditto, but with rY&ert orientation reversed. 

61 . PlavmJ pGS*fi£8 

Derived tium (4>i t > mutton of GGAP-alpha globin expression cassette from (48). 

62 . Plasm«d pGC 1 £09 

Derived from (C ) ty f frmcvai of Pstl site. 

63. Plasmid pGS 1 080 

Derived from by remcval of Spel site. 

64. Plasmd pGS2 1 89 

Derived from (6^ ) and (63) Encodes di-alpha qlobin with RGGV linker. 

65. PlasrtxJ pGS2989 

Derived from (64) and (47). contains di-alpha globin gene and beta globin gene. 

66. Plasmid pGS3C89 

Derived from (53) by insertion of expression cassette of (65). 
66a. Phagescnpt 

Phage commercially available from Stratagene. 

67. Phage Mpp-globin 

Derived from Phagescnpt by insertion of GALGAP promoter and p-qlobin qene from (65). 
67a. Plasmid pGS3089 RGV desp 

Derived from (66) by deletion of Xhol fragment containing p globin expression cassette. 

68. Plasmid pGS3889 

(66) with "Presbyterian" (pN108K) mutation. 

69. PlasidpGS5189 

(66) with "Agenogt" (P.E90K) mutation. 

70. Plasmid pGS5689 

(66) with "Kansas H *(pN102T) mutation. 

71 . Plasmid pGS4989 
(66) with "pV67r mutation. 

72. Plasmid pGS2989 RPV 

(64) with RPV di-alpha linker - - ..... 

73. Plasmid pGS2989 RGV 
(64) wiht RGV di-alpha linker 
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74. Plasmid pGS3089 RGV 

(53) with Hgb expression cassette from (73) 

75. Plasmid pGS3089 RPV 

(53) with Hgb expression cassette from (72) 


TABLE 300 


w 

TABLE OF BACTERIAL STRAINS 



Strain 

Availability 

Expression 


1 

BL21 

Brookhaven 

TAC 

15 

2 

JM109 

Commercial 

TAC 


3 

SCSI 

Commercial 

TAC 


4 

JM110 

ATCC 

TAC 


5 

LE392 

ATCC 

TAC 

20 

6 

23722 

ATCC 

TAC 


7 

W3110 

ATCC 

TAC 


8 

AG-1 

Commercial 

TAC 

25 

9 

DH1 

ATCC 

TAC 


10 

NM554 

Commercial 

TAC 


11 

N99cl+ 

Commercial 

PL 

•3/1 

12 

N4830-1 

Commercial 

PL 


35 


40 


45 


55 


Claims 
1. 


2. 


In a method for the production of a hemoglobin-like protein wherein an alpha globin-like polypeptide and a beta 
globin-like polypeptide are each expressed in transformed, non-erythrocyte cells, the improvement comprising 
expressing the alpha globin and beta globin-like polypeptides in the same cell in such manner that the alpha and 
beta globin-like polypeptides are assembled and combined with heme so as to intracellularly produce a biologically 
functional hemoglobin-like protein. 

The method of claim 1 which comprises (a) providing a yeast cell bearing (i) a gene encoding a human alpha globin- 
like polypeptide, said gene being linked to a promoter functional in said cell, and (ii) a gene encoding a human beta 
globin-like polypeptide, said gene being operably linked to a promoter functional in said cell, and (b) co-expressing 
said alpha globin and beta globin-like genes to obtain alpha globin and beta globin-like polypeptides, said cell folding 
together such polypeptides and incorporating heme to produce a human hemoglobin-like protein in soluble, recov- 
erable form. 

3. The method of claim 2 wherein at least one of said genes is expressed under the control of a GALGAP hybrid 
so promoter. 

4. The method of claim 2 wherein both genes are carried by the same plasmid. 

5. The method of claim 2 wherein said gene is carried by a different plasmid. - - 

6. The method of claim 2 wherein the yeast cell is of the species Saccharomvces cerevisiae . 

7. The method of claim 2 wherein the yeast cell is of a protease-def icient strain. 
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8. The method of claim 2 wherein the hemoglobin-like protein is a non-naturally occurring, mutant hemoglobin. 

9. The method of claim 1 which comprises (a) providing a transformed non-human cell bearing a polycistronic genetic 
unit comprising a first cistron coding for an alpha giobin-like subunit protein and a second cistron coding for a beta 

5 globin-like subunit protein which may precede or follow said first cistron, and a single promoter controlling transcrip- 

tion of said polycistronic genetic unit, said cell being capable of synthesizing heme and incorporating the heme into 
said subunit proteins so as to form a human hemoglobin-like protein, and (b) cultivating the cell under conditions 
suitable for expression of said human hemoglobin-like protein. 

10 10. The method of claim 9 in which the cell is a bacterial cell. 

1 1 . The method of claim 10 in which the bacterial cell is Escherichia colL 

12. The method of claim 1 1 in which the promoter is the Tac promoter. 

15 

13. The method of claim 9 wherein said first and second cistrons are preceded, respectively by first and second intro- 
ductory cistrons, wherein the second cistron may precede or follow the first cistron, and the first and secondary 
introductory cistrons each contain a Shine-Delgarno sequence operably positioned to facilitate translation of the 
following cistron. 

20 

14. The method of claim 10 wherein a substantial portion of the functional human hemoglobin-like protein is found in 
the soluble fraction of the cell. 

15. The method of claim 13 wherein the human hemoglobin-like protein has a lower oxygen affinity than native human 
25 hemoglobin. 


30 


35 


40 


45 


50 


55 
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lys 

leu 

leu 

ser 

his 

cys 

leu 

leu 

val 

thr 

leu 





/Kael 











GCT 

rr-r 


CTG 

CCG 

GCT 

GAA 

TTT 

ACC 

CCG 

GCT 

GTT 

CAT 

GCG 

TCT 

CTG 

CGA 



GAC 

GGC 

CGA 

CTT 

AAA 

TGG 

GGC 

CGA 

CAA 

GTA 

CGC 

AGA 

GAC 

ala 

ala 

his 

leu 

pro 

ala 

glu 

phe 

to m 

thr 

pro 

ala 

val 

his 

ala 

ser 

leu 

GAT 

AAA 

TTC 

0*wm0 i * 

CTG 

GCT 

TCT 

GTT 

TCT 

ACC 

GTT 

CTG 

ACT 

TCG 

AAA 

TAC 

0*0* WT% 

CGT 

CTA 

TTT 

AAG 

0* s 0* 

CAC 

CGA 

AGA 

CAA 

AGA 

TGG 

CAA 

GAC 

TGA 

ACG 

TTT 

ATG 

^*% m 

CGA 

asp 


D K A 

leu 

ala 

ser 

val 

ser 

thr 

val 

leu 

thr 

ser 

lys 

tyr 

arg 





/PstI 







SD*I 



TAA 

TGA 

c 

09*0* 0* 

TGC 

AGC 

TAC 

ATG 

GAG 

ATT 

AAC 

TCA 

ATC 

TAG 

AGG 

GTA 

TTA 

ATT 

ACT 

G 

& 0*0* 
ACG 

TCG 

ATG 

TAC 

CTC 

TAA 

TTG 

AGT 

TAG 

ATC 

ACC 

CAT 

AAT 







SD*2 



/Ndel 





ATA 

ATG 

TAT 

CGC 

TTA 

AAT 

AAG 

GAC 

GAA 

TAA 

CAT 

ATG 

ATC 

GAA 

GGT 

CGT 

ini 

TAC 

CGC 


AAT 

TTA 

TTC 

CTC 

CTT 

ATT 

GTA 

TAC 

TAG 

CTT 

CCA 

GCA 












met (ile 

glu 

gly 

arg; 












FoctorX sequence 











/sacii 





LrX 1 

CAC 

CTG 


CCG 

GAA 

GAA 

AAA 

TCC 

GCG 

GTT 

ACT 

GCT 

CTG 

TGG 

GGT 

CAA 

GTG 

GAC 

TGA 

GGC 

CTT 

CTT 

AAA 

AAG 

CGC T2AA 

TGA 

CGA 

GAC 

ACC 

CCA 

val 

his 

leu 

thr 

pro 

glu 

glu 

lys 

ser 

ala 

val 

thr 

ala 

leu 

trp 

gly 

AAA 

GTG 

AAC 

GTT 

GAC 

GAA 

GTT 

GGT 

GGT 

GAA 

CGT 

CTG 

GGA 

CGT 

CTG 

CTG 

TIT 

CAC 

TTG 

CAA 

CTG 

CTT 

CAA 

CCA 

CCA 

CTT 

CGA 

GAC 

CCT 

GCA 

GAC 

GAC 

xys 

val 

asn 

val 

asp 

glu 

val 

gly 

gly 

glu 

ala 

leu 

gly 

arg 

leu 

leu 














/Bg| II 

GTT 

GTT 

TAC 

CCG 

TCG 

ACT 

CAG 

CGT 

TTC 

TTT 

GAA 

TCT 

TTC 

CGA 

GAT 

CTG 

CAA 

CAA 

ATG 

GGC 

ACC 

TGA 

GTC 

GCA 

AAG 

AAA 

CTT 

AGA 

AAG 

CCT 

CTA 

GAC 

val 

val 

tyr 

pro 

trp 

thr 

gin 

arg 

phe 

phe 

glu 

ser 

phe 

gly 

asp 

leu 

TCT 

ACC 

CCG 

GAC 

GCT 

GTT 

ATG 

GGT 

AAC 

CCG 

AAA 

GTT 

AAA 

GCC 

CAT 

GGT 

AGA 

TGG 

TTC 

CTG 

CGA 

CAA 

TAC 

CCA 

TTG 

GGC 

TTT 

CAA 

TTT 

CGG 

GTA 

CCA 

ser 

thr 

pro 

asp 

ala 

val 

met 

gly 

asn 

pro 

lys 

val 

lys 

ala 

his 

gly 

AAA 

AAA 

GTT 

CTG 

GGT 

GCT 

TTC 

TCT 

GAC 

GGT 

CTG 

GCT 

CAC 

CTG 

GAC 

AAC 

TTT 

TTT 

CAA 

GAC 

CCA 

CGA 

AAG 

AGA 

CTG 

CCA 

GAC 

CGA 

GTG 

GAC 

CTG 

TTG 

lys 

lys 

val 

leu 

gly 

ala 

phe 

ser 

asp 

gly 

leu 

ala 

his 

leu 

asp 

asn 


FIG. 5 co NT. 
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/Kpnl 

CTG AAA CGT ACC TTC GCT ACT CTG 
GAC TTT CCA TGG AAG CGA TGA GAC 
leu lys gly thr phe ala thr leu 

CAC GTT GAC CCG GAA AAC TTC CGT 
GTG CAA CTG GGC CTT TTG AAG GCA 

his val asp pro glu asn phe arg 

GTT CTG GCT CAC CAC TTC GGT AAA 
CAA GAC CGA GTG GTG AAG CCA TTT 
val leu ala his his phe gly lys 

GCT TAC CAG AAA GTT GTT GCT GGT 
CGA ATG GTC TTT CAA CAA CGA CCA 
ala tyr gin lys val val ala gly 

/Hindi I I 

TAC CAC TAA TGA 

ATG GTG ATT ACT TCG A 

tyr his 


/Sad 

TCT GAG CTC CAC TGC GAC AAA CTG 
AGA CTC GAG GTG ACG CTG TTT GAC 

ser glu leu his cys asp lys leu 

/Spel 

CTG CTG GGT AAC GTA CTA GTT TGC 

GAC GAC CCA TTG CAT GAT CAA ACG 

leu leu gly asn val leu val cys 

/EcoRI 

GAA TTC ACT CCG CCG GTT CAG GCT 

CTT AAG TGA GGC GGC CAA GTC CGA 

glu phe thr pro pro val gin ala 

GTT GCT AAC GCG CTA GCT CAC AAA 
CAA CGA TTG CGC GAT CGA GTG TTT 
val ala asn ala leu ala his lys 


FIG. 5. cont. 
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FIG 7 


Hemoglobin Beth Israe 


1 (beta asn — >ser) : 

/Sad 

C CAC TGC GAC AAA CTG 
TC GAG GTG ACG CTG TTT GAC 
leu his cys asp lys leu 


/Spel 

CAC GTT GAC CCG GAA tct TTC CGT CTG CTG GGT AAC GTA 

GTG CAA CTG GGC CTT aga AAG GCA GAC GAC CCA TTG CAT GAT C 

his val asp pro glu SER phe arg leu leu gly asn val leu 

Hemoglobin Cheverly (beta phe — >ser) : 

/Serll 

CG GTT ACT GCT CTG TGG GGT 
G CGC CAA TGA CGA GAC ACC CCA 

ala val thr ala leu trp gly 

AAA GTC AAC GTT GAC GAA GTT GGT GGT GAA GCT CTG GGA CGT CTG CTG 
TTT CAC TTG CAA CTG CTT CAA CCA CCA CTT CGA GAC CCT GCA GAC GAC 
lys val asn val asp glu val gly gly glu ala leu gly arg leu leu 

/Bglll 

GTT GTT TAC CCG TGG ACT CAG CGT TTC TTT, GAA TCT tct GCA 

CAA CAA ATG GGC ACC TGA GTC GCA AAG AAA CTT AGA aga CCT CTA G 

val val tyr pro trp thr gin arg phe phe glu ser SER gly asp 

Hemoglobin beta val — >ile: 

/Ncol 
CAT GCT 
CA 

gly 

AAA AAA ate CTG GGT GCT TTC TCT GAC GGT CTG GCT CAC CTG GAC AAC 
TTT TTT tag GAC CGA CCA AAG AGA CTG CCA GAC CGA GTG GAC CTG TTG 
lys lys lie leu gly ala phe ser asp gly leu ala his leu asp asn 

/Kpnl 

CTG AAA GGT AC 
GAC TTT C 
leu lys gly 

Hemoglobin Kansas (beta asn — >thr) 

/SacI 

C CAC TGC GAC AAA CTG 
TC GAG GTG ACG CTG TTT GAC 
glu leu his cys asp lys leu 

/Spel 

CAC GTT GAC CCG GAA acc TTC CGT CTG CTG GGT AAC GTA 

GTG CAA CTG GGC CTT tgg AAG GCA GAC GAC CCA TTG CAT GAT C 

his val asp pro glu asn phe arg leu leu gly asn val leu val 
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CO 
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/ecori /Xmal A 

A ATT CGA GCT CGG TAC CCG GGC TAC ATG GAG 









GCT 

CGA 

GCC 

ATG 

GGC 

CCG 





SD*I 








ATT 

AAC 

TCA 

ATC 

rp ■% f+ 
IACj 

AGG 

GTA 

TTA 

ATA 

ATG 

TAT 

CGC 

TTA 

TAA 

TTG 

AGT 

TAG 

ATC 

TCC 

CAT 

AAT 

TAT 

TAC 

ATA 

GCG 

AAT 










met 

tyr 

arg 

leu 

GAA 


/Ndel 







/Eagl 


TAA 

CAT 

ATG 

CTG 

TCT 

CCG 

GCC 

GAT 

AAA 

ACC 

AAC 

GTT 

CTT 

ATT 

GTA 

TAC 

GAC 

AGA 

GGC 

CGG 

CTA 

TTT 

TGG 

TTG 

CAA 

glu 



met 

leu 

ser 

pro 

ala 

asp 

lys 

thr 

asn 

val 

TGG 

GGT 

AAA 

GTT 



CAC 

GCT 

GGT 

GAA 

TAC 

GGT 

GCT 

ACC 

CCA 

TTT 

CAA 

CCG 

CGC 

GTG 

CGA 

CCA 

CTT 

ATG 

CCA 

CGA 

trp 

gly 

lys 

val 

gly 

ala 

his 

ala 

gly 

glu 

tyr 

gly 

ala 

GAG 

CGT 

ATG 

TTC 

CTG 

TCT 

TTC 

CCG 

ACC 

ACC 

AAA 

ACC 

TAC 

CTC 

GCA 

TAC 

AAG 

GAC 

AGA 

AAG 

GGC 

TGG 

TGG 

TTT 

TGG 

ATG 

glu 

arg 

met 

phe 

leu 

ser 

phe 

pro 

thr 

thr 

lys 

thr 

tyr 

TTC 







/MstI 





GAC 

CTG 

TCT 

CAC 

GGT 

TCT 

CGC 

GAG 

GTT 

AAA 

GGT 

CAC 

AAG 

CTG 

GAC 

AGA 

GTG 

CCA 

AGA 

GCG 

GTC 

CAA 

TTT 

CCA 

GTG 

phe 

asp 

leu 

ser 

his 

gly 

ser 

ala 

gin val 

lys 

gly 

his 

GTT 

GCT 

GAT 

GCT 

CTG 

ACC 

AAC 

GCT 

GTT 

GCT 

CAC 

GTT 

GAT 

CAA 

CGA 

CTA 

CGA 

GAC 

TGG 

TTG 

CGA 

CAA 

CGA 

GTG 

CAA 

CTA 

val 

ala 

asp 

ala 

leu 

thr 

asn 

ala 

val 

ala 

his 

val 

asp 


SD*2 


/Xhol 


/Mlu I 

AAC GCG TTG TCT GCT CTG TCT GAT CTG CAC GCT CAC AAA CTG CGT GTT 
TTG CGC AAC AGA CGA GAC AGA CTA GAC GTG CGA GTG TTT GAC GCA CAA 

asn ala leu ser ala leu ser asp leu his ala his lys leu arg val 
/Hpal 

GAT CCG GTT AAC TTC AAA CTG CTG TCT CAC TGC CTG CTG GTT ACT CTG 
CTA GGC CAA TTG AAG TTT GAC GAC AGA GTG ACG GAC GAC CAA TGA GAC 
asp pro val asn phe lys leu leu ser his cys leu leu val thr leu 

/NaeX 

GCT GCT CAT CTG CCG GCT GAA TTT ACC CCG GCT GTT CAT GCG TCT CTG 
CGA CGA GTA GAC GGC CGA CTT AAA TGG GGC CGA CAA GTA CGC AGA GAC 

ala ala his leu pro ala glu phe thr pro ala val his ala ser leu 

/BstBI 

GAT TTC CTG GCT TCT GTT TCT ACC GTT CTG ACT TCG AAA TAC CGT 

CTA TTT AAG CAC CGA AGA CAA AGA TGG CAA GAC TGA AGC TTT ATG GCA 

asp lys phe leu ala ser val ser thr val leu thr ser lys tyr arg 


FIG 12 
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/EagI 

cS cS SI SS IS ESS GAT *** ACC AAC GTT AAA GOT GCT 

CCA CCA CAA GAC AGA GGC CGG CTA TTT TGG TTG CAA TTT CGA CGA 

Sg-jgX val leu ser pro ala asp l ys thr asn val iyl ala ala 

let ecl m SI SS S 2E GCT GGT GAA TAC GGT GCT GAA GCT eg?* 

trp gj? Iyl St S5 SIS 5* ^ S* ATG CCA CGA OT CGA G AG 
gxy lys vai gly ala his ala gly g i u tyr gly ala glu ala leu 

Sc SS Jac aIg IS H? SS acc acc *** ACC TAC m CCG CAC 

alu I A f ? AC AGA AAG GGC TGG TGG TTT TGG ATG AAG GGC GTG 

glu arg met phe leu ser phe pro thr thr lys thr tyr phe pro his 

Hg S£ S? ISI S£ SS T CT GCgM ^ G GTT AAA GGT CAC GGT AAA AAA 
AAG CTG GAC AGA GTG CCA AGA GCG GTC CAA TTT CCA GTG CCA TTT TTT 
Phe asp leu ser his gly ser ala gin val lys glj gly JJJ TJI 

SI SS SI SI Sr iS JS£ GCT GTT GCT CAC GTT GAT GAT ATG CCG 

val SS S« f? A ?* C ?° 1X0 CGA CGA GTG CAA CTA CTA TAC GGC 

val ala asp ala leu thr asn ala val ala his val asp asp met pro 

/Mlu I 

WG CGC US IS SS S G TCT GAT CTG CAC GCT CAC AAA CTG CGT GTT 

alS S5f ?£S « i f? A GAC AGA CTA GAC GTG ^ GTG TTT GAC GCA CAA 
asn ala leu ser ala leu ser asp leu his ala his lys leu arg val 

/Hpal 

SI GGC SI iJr US ™ S G CTG TCT CAC TGC CTG CTG GTT ACT CTG 

«p pro Si H» 15* f AC GAC AGA GTG ACG GAC GA C CAA TGA GAC 

asp pro val asn phe lys leu leu ser his cys leu leu val thr leu 

/Nael 

SS SS 5^ T 0X6 CCG GCT GAA TTT ACC CCG GCT GTT CAT GCG TCT CTG 

ala S Sis ? A n S GC C ? A S* M TGG GGC CGA cS SI CGC IS SS 
ala ala his leu pro ala glu phe thr pro ala val his ala ser leu 

SI TTT Hg Sc 22 IS SI IS ACC GTT CTG ACT ~"25 TAC CGT 
aso lvl r^S ?* C °? A AGA ^ TGG CAA GAC TGA AGC TTT ATG GCA 

asp lys p h e leu ala ser val ser thr val leu thr ser lys tyr arg 


TAA TGA 
ATT ACT 


C TGC A 
G 


/Pstl 


B 


SD.I 


GC TAC ATG GAG ATT AAC TCA 
ACG TCG ATG TAC CTC TAA TTG AGT 


SDg2 


/Ndel 


55 5S S A Hi JK ATG TAT CGC WA AAG GAG GA A TAA CAT 

TAG ATC ACC CAT AAT TAT TAC ATA GCG AAT TTA TTC CTC CTT ATT GTA 

met tyr arg keu asn lys glu glu 


FIG.I2 cont. 
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/SacII 


ATG 

CAC 

CTG 

ACT 

CCG 

GAA 

GAA 

AAA 

TCC 

GCG 

GTT 

ACT 

GCT 

CTG 

TGG 

GGT 

TAC 

GTG 

GAC 

TGA 

GGC 

CTT 

CTT 

AAA 

AGG 

CGC 

CAA 

TGA 

CGA 

GAC 

ACC 

CCA 

met 

his 

leu 

thr 

pro 

glu 

glu 

lys 

ser 

ala 

val 

thr 

ala 

leu 

trp 

gly 

AAA 

GTG 

AAC 

GTT 

GAC 

GAA 

GTT 

GGT 

GGT 

GAA 

GCT 

CTG 

GGA 

CGT 

CTG 

CTG 

TTT 

CAC 

TTG 

CAA 

CTG 

CTT 

CAA 

CCA 

CCA 

CTT 

CGA 

GAC 

CCT 

GCA 

GAC 

GAC 

lys 

val 

asn 

val 

asp 

glu 

val 

gly 

gly 

glu 

ala 

leu 

gly 

arg 

leu 

leu 














/Bgl II 

GTT 

GTT 

TAC 

CCG 

TGG 

ACT 

CAG 

CGT 

TTC 

TTT 

GAA 

TCT 

TTC 

GGA 

GAT 

CTG 

CAA 

CAA 

ATG 

GGC 

ACC 

TGA 

GTC 

GCA 

AAG 

AAA 

CTT 

AGA 

AAG 

CCT 

CTA 

GAC 

val 

val 

tyr 

pro 

trp 

thr 

gin 

arg 

phe 

phe 

glu 

ser 

phe 

gly 

asp 

leu 














/Ncol 

TCT 

ACC 

CCG 

GAC 

GCT 

GTT 

ATG 

GGT 

AAC 

CCG 

AAA 

GTT 

AAA 

GCC 

CAT 

GGT 

A6A 

TGG 

TTC 

CTG 

CGA 

CAA 

TAC 

CCA 

TTG 

GGC 

TTT 

CAA 

TTT 

CGG 

GTA 

CCA 

ser 

thr 

pro 

asp 

ala 

val 

met 

gly 

asn 

pro 

lys 

val 

lys 

ala 

his 

gly 

AAA 

AAA 

GTT 

CTG 

GGT 

CGT 

TTC 

TCT 

GAC 

GGT 

CTG 

GCT 

CAC 

CTG 

GAC 

AAC 

TTT 

TTT 

CAA 

GAC 

CCA 

CGA 

AAG 

AGA 

CTG 

CCA 

GAC 

CGA 

GTG 

GAC 

CTG 

TTG 

lys 

lys 

val 

leu 

gly 

ala 

- 

phe 

ser 

asp 

gly 

ij ~ j 

leu 

ala 

his 

leu 

asp 

asn 




/Kpnl 






/Sad 





CTG 

AAA 

GGT 

ACC 

TTC 

GCT 

ACT 

CTG 

TCT 

GAG 

CTC 

CAC 

TGC 

GAC 

AAA 

CTG 

GAC 

TTT 

CCA 

TGG 

AAG 

CGA 

TGA 

GAC 

AGA 

CTC 

GAG 

GTG 

ACG 

CTG 

TTT 

GAC 

leu 

lys 

gly 

thr 

phe 

ala 

thr 

leu 

ser 

glu 

leu 

his 

cys 

asp 

lys 

leu 

CAC 













/Spel 


GTT 

GAC 

CCG 

GAA 

AAC 

TTC 

CGT 

CTG 

CTG 

GGT 

AAC 

GTA 

CTA 

GTT 

TGC 

GTG 

CAA 

CTG 

GGC 

CTT 

TTG 

AAG 

GCA 

GAC 

GAC 

CCA 

TTG 

CAT 

GAT 

CAA 

ACG 

his 

val 

asp 

pro 

glu 

asn 

phe 

arg 

leu 

leu 

gly 

asn 

val 

leu 

val 

cys 

GTT 







/EcoRI 








CTG 

GCT 

CAC 

CAC 

TTC 

GGT 

AAA 

GAA 

TTC 

ACT 

CCG 

CCG 

GTT 

CAG 

GCT 

CAA 

GAC 

CGA 

GTG 

GTG 

AAG 

CCA 

TTT 

CTT 

AAG 

TGA 

GGC 

GGC 

CAA 

GTC 

CGA 

val 

leu 

ala 

his 

his 

phe 

gly 

lys 

glu 

phe 

thr 

pro 

pro 

val 

gin 

ala 

GCT 

TAC 

CAG 

AAA 

GTT 

GTT 

GCT 

GGT 

GTT 

GCT 

AAC 

GCG 

CTA 

GCT 

CAC 

AAA 

CGA 

ATG 

GTC 

TTT 

CAA 

CAA 

CGA 

CCA 

CAA 

CGA 

TTG 

CGC 

GAT 

CGA 

GTG 

TTT 

ala 

tyr 

gin 

lys 

val 

val 

ala 

gly val 

ala 

asn 

ala 

leu 

ala 

his 

lys 


/Hindlll 

TAC CAC TAA TGA 

ATG GTG ATT ACT TCG A 

tyr his 


FIG. 12 co NT. 
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PTACJ 
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Translational Coupler 

_SD 2 MET STf 

5* AAT AAG GAG GAA TAA CAT ATG CTG TCT CCG GCC GAT 
TTTATTC CTC CTTATTGTA TAC GAC AGA GGC CGG CT A 

£ag I 

AAG GCC CCA AGC TTG GGG 3* 
TTC CGG GG T TCG AAC CCC 5? 
Hind III 
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pPL- LAMBDA- E 



Hnd3 


FIG. l9 cont. 
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RGV LINKER Hind 3 

•Eagl 


Bst b! ALPHA AND BETA 6L0BIN 
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+ synthetic GAP 
sequence 



FIG. 21 a 


Kpnl Blunt end withT4 

Polymerose, religate with 
" Sphl linkers 

PGS2888 
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